BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Drupal//recurring_events_ical//2.0//EN
BEGIN:VEVENT
UID:1f41c629-0823-45ff-bba1-44857b43dd70@support.access-ci.org
DTSTAMP:20250401T185201Z
DTSTART:20250409T180000Z
DTEND:20250409T200000Z
SUMMARY:CI Pathways: HPC Data Science with Apache Spark I
DESCRIPTION:This session will introduce essential tools and techniques for 
 manipulating very large datasets. It will explore common challenges and pi
 tfalls encountered when migrating from more traditional databases and how 
 to mitigate them. The session will feature Apache Spark, an open-source un
 ified analytics engine designed for large-scale data processing, and intro
 duce the basics of how to use PySpark, the Python API for Apache Spark.
  By the end of the session, attendees will have a solid foundation in man
 aging large datasets and be prepared to tackle complex data processing tas
 ks.
URL:https://support.access-ci.org/events/7903
END:VEVENT
END:VCALENDAR