The Data Engineering Developer is responsible for the design, build and deployment of a project's Data Engineering components. A typical Data Warehouse or Data Lake effort usually involves multiple Data Engineering Developers developing the Informatica mappings, executing them to move and/or transform data and validating the initial results. These tasks involve data ingestion into storage technologies such as Hadoop, NoSQL, Cloud Storage or other storage mediums. Data Integration and Data Quality mappings will often be built and performed in these storage technologies as data is consolidated and harmonized. Data extraction may also be required as data from the Data Warehouse or Data Lake is used to feed a number of reporting, regulatory and analytic purposes.
Reports to:
- Technical Project Manager
Responsibilities:
- Uses Informatica Data Engineering to extract data from external sources and ingest them into storage technologies such as Hadoop, No SQL, or Cloud Storage
- Uses Informatica Data Engineering to perform data integration operations on data within the selected storage medium
- Uses Informatica Data Engineering to perform data quality operations on data within the selected storage medium
- Develops Data Integration workflows and load processes.
- Ensures adherence to locally defined standards for all developed components.
- Performs data analysis for both Source and Target tables/columns/files to determine data movement, integration and quality needs
- Provides technical documentation of Source to Target mappings.
- Participates in design and development reviews.
- Works with System owners to resolve source data issues and refine transformation rules.
- Ensures performance metrics are met and tracked.
- Writes and maintains unit tests.
- Conduct QA Reviews.
Qualifications/Certifications
- Understands data integration processes and with a working knowledge of the chosen Data Lake or Data Warehouse storage technology
- Has a basic understanding of relevant data integration skills such as primary and foreign key relationships, transactional vs domain data, data refresh patterns and data recovery processes
- Possesses excellent communications skills
- Has the ability to develop work plans and follow through on assignments with minimal guidance
- Has Informatica Data Engineering Platform and Informatica data integration mapping experience or experience with a similar object-based data integration technology
- Has the ability to work with business and system owners to obtain requirements and manage expectations
Recommended training
- Corresponding Data Storage technology training based on your organizations chosen platform (examples: Hadoop, Kafka, NoSQL, Snowflake, Redshift, Databricks, etc.)
- Informatica University: Big Data for Developers
- Informatica University: Big Data Streaming for Developers
- Informatica University: Informatica Developer Tool for Big Data Developers
- Informatica University: Data Quality: Data Quality Management for Developers
For more details on Informatica courses go to Informatica University Curriculum.