LinkedIn manages and utilizes data from over 700 million users worldwide. LinkedIn has learned to use this Big Data repository to power AI, Data Science, and Data Products. Over the years, its Big Data operations have evolved to use a sophisticated infrastructure stack that includes:
- Data Ingestion
- Distributed Storage
- Privacy
- Scheduled Compute
- Interactive Compute
One of the ways that LinkedIn has kept pace with its growth and increased functionality is through Open Source technologies. Vasanth Rajamani, a Director at LinkedIn, will discuss:
- The role that Big Data plays at LinkedIn
- The infrastructure stack
- Examples of Open Source technologies in the infrastructure stack, including Apache Spark and the Apache Gobblin ingestion framework
Vasanth will tell you how you can engage with the Open Source community and evaluate Big Data as a student or as a mid-career professional. The live webinar will include a Q&A with Vasanth. (If you register and can’t make the live webinar, we will send you a link to the recording after the event.)
About the Speaker
Vasanth is a Director at LinkedIn. He manages big data platforms and infrastructure at Linkedin (e.g. Hadoop, Spark, Deep Learning Training, Gobblin, Dali, Azkaban). His professional experience includes Oracle, Microsoft, and Mi5 Networks.
He earned his PSEE at Purdue University and his MSEE and Ph.D. at the University of Texas at Austin.