Loading…
June 21-24, 2022
Austin, Texas, USA + Virtual
View More Details & Registration
Note: The schedule is subject to change.

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for Open Source Summit North America 2022 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

This schedule is automatically displayed in Central Daylight Time (UTC -5). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date."

IMPORTANT NOTE: Timing of sessions and room locations are subject to change.

Tuesday, June 21 • 4:05pm - 4:45pm
Real-Time Analytics: Going Beyond Stream Processing with Apache Pinot - Rong Rong & Karin Wolok, StarTree

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
Apache Kafka forms the backbone of the modern data pipeline and its stream processing capabilities provide insights on events as they arrive, but what if we want to go further than this and execute analytical queries on this real-time data. The OLAP databases used for analytical workloads traditionally executed queries on yesterday's data with query latency in the 10s of seconds. The emergence of real-time analytics has changed all this and the expectation is that we should now be able to run thousand of queries per second on fresh data with query latencies typically seen on OLTP databases. This is where Apache Pinot comes into the picture. Apache Pinot is a realtime distributed OLAP datastore, which is used to deliver scalable real time analytics with low latency. It can ingest data from streaming sources like Kafka, as well as from batch data sources (S3, HDFS, Azure Data Lake, Google Cloud Storage), and provides a layer of indexing techniques that can be used to maximize the performance of queries. Come to this talk to learn how you can add real-time analytics capability to your data pipeline.

Speakers
KW

Karin Wolok

Head of Developer Community, StarTree
Karin is Head of Developer Marketing and Community for StarTree, a start-up founded by the original creators of Apache Pinot. From a B.A. in broadcasting and a background in major entertainment and event production companies, she started exploring tech fields and discovered her love... Read More →
RR

Rong Rong

Software Engineer, StarTree
Rong is a software engineer from StarTree. He is passionate about building data analytics, machine learning & stream processing platforms; and hacking on various OSS frameworks and tools. Prior to StarTree, Rong worked as software engineer in Facebook, Uber and LinkedIn; and practiced... Read More →


Tuesday June 21, 2022 4:05pm - 4:45pm CDT
Room 408/409 (Level 4)