Powering LinkedIn Big Data Through Open Source Technologies

About the Webinar

LinkedIn manages and utilizes data from over 700 million users worldwide.  LinkedIn has learned to use this Big Data repository to power AI, Data Science, and Data Products. Over the years, its Big Data operations have evolved to use a sophisticated infrastructure stack that includes:

  • Data Ingestion
  • Distributed Storage
  • Privacy
  • Scheduled Compute
  • Interactive Compute

One of the ways that LinkedIn has kept pace with its growth and increased functionality is through Open Source technologies. Vasanth Rajamani, a Director at LinkedIn, will discuss:

  • The role that Big Data plays at LinkedIn
  • The infrastructure stack
  • Examples of Open Source technologies in the infrastructure stack, including Apache Spark and the Apache Gobblin ingestion framework 

Vasanth will tell you how you can engage with the Open Source community and evaluate Big Data as a student or as a mid-career professional. The live webinar will include a Q&A with Vasanth. (If you register and can’t make the live webinar, we will send you a link to the recording after the event.)

About the Speaker

Vasanth is a Director at LinkedIn.  He manages big data platforms and infrastructure at Linkedin (e.g. Hadoop, Spark, Deep Learning Training, Gobblin, Dali, Azkaban).  His professional experience includes Oracle, Microsoft, and Mi5 Networks.

He earned his PSEE at Purdue University and his MSEE and Ph.D. at the University of Texas at Austin.

Hosted By


Simplilearn is one of the world’s leading providers of online training for Digital Marketing, Cloud Computing, Project Management, Data Science, IT, Software Development, and many other emerging technologies.

View More
  • Disclaimer
  • PMP, PMI, PMBOK, CAPM, PgMP, PfMP, ACP, PBA, RMP, SP, and OPM3 are registered marks of the Project Management Institute, Inc.