June 21-24, 2022
Austin, Texas, USA + Virtual
View More Details & Registration
Note: The schedule is subject to change.

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for Open Source Summit North America 2022 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

This schedule is automatically displayed in Central Daylight Time (UTC -5). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date."

IMPORTANT NOTE: Timing of sessions and room locations are subject to change.

Thursday, June 23 • 4:55pm - 5:35pm
Uncovering Community and Project Insights Through Data Driven Methods - Oindrilla Chatterjee & Karanraj Chauhan, Red Hat

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
How do you assess and track the health of your project in an open source, data-driven way? Github repositories reveal crucial information about the project e.g. velocity, blockers, community health, etc. which can ultimately help guide the development of the project. In this talk, we show how we can get deeper insights into the software development process by creating a set of open source data science workflows that collect data about a repo’s PR’s, analyze them, and visualize key metrics on a dashboard. We also show how to build and deploy ML models that can be used to supplement the project development process. Specifically, we show how we can create these dashboards and services on an open source community cloud where data scientists get an environment for solving data science challenges without setting up the infrastructure and services. This also provides an environment where operations are open source, so that data scientists have full visibility into the platform and their workloads every step of the way. By the end of this talk, attendees would have learned how to use the set of ML tools to derive key metrics from their Github repos and use the open source tools for creating reproducible notebooks, models as services, building automated pipelines and dashboards.

Git Repo - https://github.com/aicoe-aiops/ocp-ci-analysis
Contact - ochatter@redhat.com, kachau@redhat.com


Shachi Vaman Khadilkar

Student, University of Massachusetts-Lowell
avatar for Oindrilla Chatterjee

Oindrilla Chatterjee

Senior Data Scientist, Red Hat
Oindrilla is a Senior Data Scientist at Red Hat, in the Office of the CTO working on emerging trends and research in ML and AI. She works on evaluating new tools, platforms, and methodologies in the open source Data Science ecosystem, for enhancing Red Hat products and internal services... Read More →

Thursday June 23, 2022 4:55pm - 5:35pm CDT
Room 408/409 (Level 4)