- Learning Path
- Enterprise Data Catalog: Intermediate
Informatica Enterprise Data Catalog (EDC) provides business and IT users with powerful semantic search and dynamic facets to filter search results, data lineage, profiling statistics, 360-degree relationship views, data similarity recommendations, and an integrated business glossary. This enables IT users to be more productive and business users to be able to be full partners in the management and use of data. You can now easily and efficiently manage enterprise data assets to maximize their value throughout the company. Business users can quickly find data and easily manage the lifecycle of business terms, definitions, reference data, and more.
The Intermediate level constitutes of many 'how-to' documents, whitepapers and videos that will help you understand EDC installation, configuration, resource management, monitoring, profiling, similarity discovery and many such concepts.
After you successfully finish all the three levels of EDC product learning, you will earn an Informatica Badge for EDC. So continue on your learning!
This module covered topics on installation and configuration and getting started with EDC. The section also walked you through the architecture, resource management, configuring security, Business Glossary, Custom Model, Resource Management, Scanners, Schedule Management, Monitoring, and Resource Security.
This module also covered EDC Profiling, Data Domains, Similarity Discovery, Attribute Management, classifications and Connection Assignment that includes creating a connection, refreshing the connection list, viewing a connection, configuring pooling for a connection, editing and testing a connection and deleting a connection.
You also learned about Synonyms, how to add synonyms to your search, Lineage, Lineage Export, custom lineage options, how to get detailed lineage in EDC for PowerCenter mappings and Intelligent Business Glossary Association.
Now move on to the Advanced level for your EDC product journey and get to know more about the product.
Contents
Go through the following documents and Knowledge Base articles to understand Enterprise Data Catalog (EDC) installation and configuration:
EDC Installation and Configuration
EDC Installation: High-Level Overview
HOW TO: Validate EDC pre-requisites
HOW TO: Configure Ports in EDC
HOW TO: Switch from non-Kerberos to Kerberos
HOW TO: Switch from non-TLS enabled to TLS
FAQ: What are the perquisites for IHS
HOW TO: Validate utility for IHS Prerequisites
FAQ: What are the IHS load types
HOW TO: Configure IHS on SSL enabled domain
HOW TO: Configure SUDO Permission
This Meet the Experts webinar helps you understand the EDC architecture better.
Additional Resource: This document describes the overall architecture of EDC and details the different components the architecture relies on. It talks about the service-oriented architecture that EDC follows, Informatica domain nodes, application services, repository databases, and Hadoop Cluster Services.
This video discusses EDC 10.2.2 Architecture Updates, Graph Refactoring, and steps for the upgrade.
This video explains how to create a new resource to extract metadata from an Oracle database, and run data profiling and domain discovery during the extraction process followed by a demo.
EDC Resource Configuration: This article lists the permissions that you must configure for data sources before you configure resources in EDC. The article also lists the types of object metadata that the resources extract from the data sources.
Click here to read the document
EDC Prerequisites for Resource Configuration: This article provides information about the prerequisites for specific resources in Enterprise Information Catalog. For detailed steps and information about installation, see the Informatica Catalog Administrator Guide.
Browse through the following articles and documents to learn more about resource management:
HOW TO: Configure snowflake resource
HOW TO: Configure Sybase resource
HOW TO: Configure Oracle cluster resource
HOW TO: Configure Oracle resource
HOW TO: Provide permissions for oracle resource scan
HOW TO: Configure JDBC MYSQL resource
HOW TO: Configure Azure Blob storage
HOW TO: Configure Salesforce resource
HOW TO: Configure Salesforce with Proxy
HOW TO: Configure DBScript scanner for BTEQ
HOW TO: Configure Postgres SQL resource
HOW TO: Configure BG Scanner with SSL
HOW TO: Configure HDFS resource
HOW TO: Configure DB2Zos scanner
HOW TO: Configure Cloud scanner in non SSL domain
HOW TO: Configure Cloud scanner
HOW TO: Configure SAP HANA Scanner
HOW TO: Configure Greenplum JDBC Scanner
HOW TO: Configure Microstrategy Scanner
HOW TO: Configure AWS Dynamo DB JDBC Scanner
HOW TO: Configure Netezza Scanner
HOW TO: Configure Profile warehouse scanner
FAQ's on Google Big Query resource
Resource types for tracking metadata changes
Filter tables and views from Relational source for the profiling in EDC 10.2.2
Use filter option for Source Metadata and Data Profile Filter in EDC 10.2.2
Filter tables and views from Relational source for the metadata scan in EDC 10.2.2
This article provides information about tuning Enterprise Data Catalog performance. Tuning Enterprise Data Catalog performance involves tuning parameters for metadata ingestion, ingestion database, search, and tuning data profiling.
The performance of Enterprise Data Catalog depends on the size of data that needs to be processed. The article lists the parameters that you can tune in Enterprise Data Catalog and the steps that you must perform to configure the parameters based on the data size.
In this video, you will see how to create reusable schedules. You will also be able to create a reusable schedule to assign multiple resources to the same schedule with the help of a demo in this video.
Additional Resources:
HOW TO: Schedule a resource in Live Data Map
This video explains how to monitor EDC based on resources and tasks. You will also learn how to use the Catalog Administrator to monitor resources and track the status and schedules of tasks.
Additional Resources:
HOW TO: Monitor the loaded/created resource in Live Data Map
In this video, you will see how to secure client records in an enterprise or an organization by assigning specific permissions only to authorized users with the help of a demo. You will learn to configure specific permissions and default permissions on resources for users and user groups that are configured in the Informatica domain.
Additional Resource:
Go through the following documents to learn more about profiling in EDC.
This video explains how to create a data domain. You will learn to create a data domain to discover the functional meaning of data in the data sources based on the semantics of the data with the help of a demo.
In this video, you will see how to curate data domains in bulk. You will also learn how to validate and manage discovered metadata of a data source using Curation so that the metadata is fit for use and reporting with the help of a demo.
In this video, you will see how to create a composite data domain named customer to include details of the customer such as name, email, and gender. You will also learn to create composite data domains using rules with existing data domains and new data domains with the help of a video.
As a data analyst or data architect, you can scan your enterprise data to find similar columns. This document will guide you on column similarity that includes how column similarity works, column similarity process, business examples and to propagate business terms.
Click here to read about Similar Columns, Value Frequency, and Permissions for Value Frequency
This video demonstrates the process of using Python to read the contents of Excel, lookup the equivalent catalog object, and update the catalog if necessary with the help of a video.
Additional Resource: EDC - Bulk import custom attributes
This video explains how to use Catalog Administrator to edit the pre-defined system attributes that are extracted as metadata from source systems. With the help of a demo, the video explains how to configure the predefined attribute named author.
In this video, you will see how to create a custom attribute and how to use that attribute. You will learn to create custom attributes that you want EDC users to add to the search filters and include these custom attributes in the catalog.
Additional Resources:
HOW TO: Create Custom Attributes in EDC
HOW TO: Create a custom attribute with an array of pre-defined values in EDC
HOW TO: Search by custom attribute
Go through the following Knowledge Base articles to learn about classifications in EDC:
This video explains how to assign a schema to the connection as well as multiple connection assignments. You will also learn to manage connections between resources and schemas by assigning, unassigning, or reassigning connections to resources.
Additional Resources:
HOW TO: Perform connection assignment for multiple connections in EDC
In this video, you will see how to add synonyms to your search. You will also learn how to import synonym data into the catalog and search for an asset using a synonym with the help of a demo.
Additional Resource:
Go through the following articles to learn about Lineage in EDC:
How to get Detailed Lineage in EDC for PowerCenter mappings
Go through the following Knowledge Base articles to learn about Glossary Association in EDC:
Multiple Business term association
Intelligent Business glossary association
This session shows how to create an EDC custom attribute that is driven from a category created in Business Glossary.
This webinar will guide you through the key use cases of Enterprise Data Catalog (EDC) for your digital transformation, and also discuss the new capabilities of EDC 10.4.
This session discusses methods and process for customers who have purchased Axon, EDC and IDQ and want to get the most value out of the platform.
We will also share real-world best practices for operationalizing Data Governance.
Learn how to extract the profile and domain discovery results from Informatica Data Quality to Enterprise Data Catalog
This webinar is intended for Data Practitioners, Data Architects, Data Stewards, Data Program Managers, Data Engineers, and Data Scientists to understand the new capabilities in 10.4.1 release of EDC. This session will cover benefits of each new feature, when to use it, and how best to use it. This session will also include feature demos. At the end of this session, you will have the knowledge to adopt these new capabilities in 10.4.1 to bring efficiencies in your data management practice.
This webinar is intended for business users as well as for Administrators to understand the importance of Axon-EDC integration. This webinar will also provide an overview of the features available.
You have massive amounts of data across hundreds of data sources, from complex enterprise applications and legacy systems to scripting languages and stored database procedures. To optimize its business value, you need to break down the silos, making this metadata available to your data catalog.
In this webinar, we will provide an introduction and deep dive into the new "Advanced Scanners" capabilities from EDC.
Specifically, you will learn:
The challenges of extracting metadata from complex enterprise systems
The benefits of using Informatica Enterprise Data Catalog (EDC) Advanced Scanners to simplify and accelerate metadata and data lineage extraction
How to extract metadata from the most complex systems including Stored Procedures, Mainframes, SAS, and others.
