Scaling S3 Parquet Scanning Performance using EMR

This webinar is intended for all the customers who are on Cloud Data Lake using S3 – specifically Parquet, having partitions and complex data types like struct, array, and more.
Here is the agenda for this session:
  • Use-cases on S3
  • Data Source Preparation
  • Solution architecture
  • Scanner configuration
  • Q&A
Speaker and Speaker Details

Srinivasa Gopal, Principal Customer Success Technologist


The presenter for this session is Srinivasa Gopal, an Informatica veteran from the Customer Success Technologist team. Srinivasa has been handling Data Governance and Privacy solutions and has developed a niche in Cloud Data Management solutions.