
- Learning Path
- Enterprise Data Catalog: Intermediate
Informatica Enterprise Data Catalog (EDC) provides business and IT users with powerful semantic search and dynamic facets to filter search results, data lineage, profiling statistics, 360-degree relationship views, data similarity recommendations, and an integrated business glossary. This enables IT users to be more productive and business users to be able to be full partners in the management and use of data. You can now easily and efficiently manage enterprise data assets to maximize their value throughout the company. Business users can quickly find data and easily manage the lifecycle of business terms, definitions, reference data, and more.
The Intermediate level constitutes of many 'how-to' documents, whitepapers and videos that will help you understand EDC installation, configuration, resource management, monitoring, profiling, similarity discovery and many such concepts.
After you successfully finish all the three levels of EDC product learning, you will earn an Informatica Badge for EDC. So continue on your learning!
This module covered topics on installation and configuration and getting started with EDC. The section also walked you through the architecture, resource management, configuring security, Business Glossary, Custom Model, Resource Management, Scanners, Schedule Management, Monitoring, and Resource Security.
This module also covered EDC Profiling, Data Domains, Similarity Discovery, Attribute Management, classifications and Connection Assignment that includes creating a connection, refreshing the connection list, viewing a connection, configuring pooling for a connection, editing and testing a connection and deleting a connection.
You also learned about Synonyms, how to add synonyms to your search, Lineage, Lineage Export, custom lineage options, how to get detailed lineage in EDC for PowerCenter mappings and Intelligent Business Glossary Association.
Now move on to the Advanced level for your EDC product journey and get to know more about the product.
Contents
Go through the following documents and Knowledge Base articles to understand Enterprise Data Catalog(EDC) installation and configuration
EDC Installation and Configuration
How to Enable SSL for Enterprise Data Catalog (EDC) 10.5
List of ports that should be open for Informatica Cluster Service (ICS) in EDC 10.5
HOW TO: Configure Custom SSL Certificates for Enterprise Data Catalog 10.5 (SP1)
HOW TO: Validate Cluster Prerequisites with Validation Utility in Enterprise Data Catalog 10.5.1
HOW TO: Configure SSL for ICS in EDC 10.5
Installing Advanced Scanner post-EDC installation in 10.5 and validating the license.
Check out this article to learn more.
EDC Upgrade Resources:
The above webinar is intended for all Enterprise Data Catalog administrators who would like to learn more about the requirements to upgrade to the new EDC 10.5 version in light of the upcoming 10.4 End of Life (EOL).
EDC Upgrade Planner: This update provides the latest ecosystem & connectivity support, security enhancements, cloud support, and performance enhancements while improving the user experience.
This webinar is intended to give you a heads up on the EDC 10.5 release. This session will allow users to familiarize themselves better with the release and provide more insights into its architecture and functionality.
This document explains the high-level architecture of a domain on multiple nodes.
This document explains the Enterprise Unified Metadata Architecture and describes the architecture components.
This article lists the significant changes included in EDC 10.5 architecture.
This video explains how to create a new resource to extract metadata from an Oracle database, and run data profiling and domain discovery during the extraction process followed by a demo.
Click here to read the Enterprise Data Catalog Scanner configuration guide.
Browse through the following articles and documents to learn more about resource management:
HOW TO: Configure snowflake resource
Check out this video, article, and document to learn how to configure snowflake resources.
HOW TO: Configure Sybase resource
Here's an article and document to know how to configure Sybase resource.
Other Resource Management Articles:
HOW TO: Configure Oracle cluster resource
HOW TO: Configure Oracle resource
HOW TO: Provide permissions for oracle resource scan
HOW TO: Configure JDBC MYSQL resource
HOW TO: Configure Azure Blob storage
Configuring Azure Microsoft SQL Data Warehouse in Enterprise Data Catalog
Configuring Amazon Redshift Resource
Configuring Amazon S3 Scanner in Enterprise Data Catalog
Configuring Axon Scanner in Enterprise Data Catalog
Configuring Workday Scanner in Enterprise Data Catalog
Configuring MDM Scanner in EDC 10.4.1 and above versions
Resource Management Articles:
HOW TO: Connect to SSO enabled Tableau server when configuring Tableau resource in EDC
HOW TO: Configure Postgres SQL resource
HOW TO: Configure BG Scanner with SSL
HOW TO: Configure HDFS resource
HOW TO: Configure DB2Zos scanner
HOW TO: Configure Cloud scanner in non SSL domain
HOW TO: Configure Cloud scanner
HOW TO: Configure SAP HANA Scanner
HOW TO: Configure a PostgreSQL resource in EDC
HOW TO: Configure Microstrategy Scanner
HOW TO: Configure AWS Dynamo DB JDBC Scanner
HOW TO: Configure Netezza Scanner
Resource Management Articles:
Parameter Files
Check out these articles to learn more about Parameter Files:
HOW TO: Create Parameter zip files to upload for PowerCenter scanner in EDC
Powercenter Parameter File Utility - FDD in EDC
Other Resource Management Articles
HOW TO: Create a Google Big Query connection and resource in EDC
HOW TO: Configure Service Principal for Microsoft Power BI in EDC
Resource types for tracking metadata changes
HOW TO: Install/configure EDC Agent on a windows machine for Qlikview resource type
HOW TO: Configure 10.5 Enterprise Data Catalog Agent for SSL
Installing Enterprise Data Catalog Agent
HOW TO: Filter tables and views from Relational source for the profiling in EDC
HOW TO: Use filter option for Source Metadata and Data Profile Filter in EDC
HOW TO: Scan an Azure SQL server database with Active Directory authentication in EDC
HOW TO: Scan Microsoft SQL server custom data type in EDC
HOW TO: Filter tables and views from Relational source for the metadata scan in EDC
The performance of Enterprise Data Catalog depends on the size of data that needs to be processed. Tuning the performance involves tuning parameters for metadata ingestion, ingestion database, search, and tuning data profiling. Use the Tuning Enterprise Data Catalog Performance in 10.5.1 guide for information about tuning Enterprise Data Catalog for optimal performance. Here's the Tuning Enterprise Data Catalog Performance in 10.5.1 Guide.
When you enable data discovery for a resource and scan the resource, Enterprise Data Catalog identifies profiling-related metadata, such as null values, distinct values, inferred data types, unique keys, and data domains in the resource. This article provides information on supported connections, authentication methods, troubleshooting, and performance metrics to run data discovery on resources.
In this video, you will see how to create reusable schedules. You will also be able to create a reusable schedule to assign multiple resources to the same schedule with the help of a demo in this video.
Additional Resources:
This video explains how to monitor EDC based on resources and tasks. You will also learn how to use the Catalog Administrator to monitor resources and track the status and schedules of tasks.
Additional Resources:
HOW TO: Monitor the loaded/created resource in Live Data Map
HOW TO: Check the Nomad Job status using Nomad Command line for EDC 10.5
In this video, you will see how to secure client records in an enterprise or an organization by assigning specific permissions only to authorized users with the help of a demo. You will learn to configure specific permissions and default permissions on resources for users and user groups that are configured in the Informatica domain.
Additional Resource:
Go through the following documents to learn more about profiling in EDC.
HOW TO: Get the list of excluded tables and views for profiling in EDC
HOW TO: Filter tables and views from Relational source for the profiling in EDC
This video explains how to create a data domain. You will learn to create a data domain to discover the functional meaning of data in the data sources based on the semantics of the data with the help of a demo.
In this video, you will see how to curate data domains in bulk. You will also learn how to validate and manage discovered metadata of a data source using Curation so that the metadata is fit for use and reporting with the help of a demo.
In this video, you will see how to create a composite data domain named customer to include details of the customer such as name, email, and gender. You will also learn to create composite data domains using rules with existing data domains and new data domains with the help of a video.
Read more about Data Domains and Data Groups
Read more about Data Discovery
HOW TO: Create Reusable Configuration for Data Discovery in EDC
As a data analyst or data architect, you can scan your enterprise data to find similar columns. This document will guide you on column similarity that includes how column similarity works, column similarity process, business examples and to propagate business terms.
Click here to read about Similar Columns, Value Frequency, and Permissions for Value Frequency
This video demonstrates the process of using Python to read the contents of Excel, lookup the equivalent catalog object, and update the catalog if necessary with the help of a video.
Additional Resource: EDC - Bulk import custom attributes
This video explains how to use Catalog Administrator to edit the pre-defined system attributes that are extracted as metadata from source systems. With the help of a demo, the video explains how to configure the predefined attribute named author.
In this video, you will see how to create a custom attribute and how to use that attribute. You will learn to create custom attributes that you want EDC users to add to the search filters and include these custom attributes in the catalog.
Additional Resources:
HOW TO: Create Custom Attributes in EDC
HOW TO: Create a custom attribute with an array of pre-defined values in EDC
HOW TO: Search by custom attribute
Go through the following Knowledge Base articles to learn about classifications in EDC:
This video explains how to assign a schema to the connection as well as multiple connection assignments. You will also learn to manage connections between resources and schemas by assigning, unassigning, or reassigning connections to resources.
Additional Resources:
HOW TO: Perform connection assignment for multiple connections in EDC
In this video, you will see how to add synonyms to your search. You will also learn how to import synonym data into the catalog and search for an asset using a synonym with the help of a demo.
Additional Resource:
Go through the following articles to learn about Lineage in EDC:
HOW TO: Carry out POC of a Custom Lineage in EDC
HOW TO: Create Column Level Lineage for EDC custom lineage scanner
Go through the following Knowledge Base articles to learn about Glossary Association in EDC:
Multiple Business term association
Intelligent Business glossary association
This session shows how to create an EDC custom attribute that is driven from a category created in Business Glossary.
This webinar will guide you through the key use cases of Enterprise Data Catalog (EDC) for your digital transformation and discuss the new capabilities of EDC 10.4.
Here's the first part of the two-part webinar series explaining Enterprise Data Catalog Advanced Scanners (MetaDex).
Here's the second part of the two-part webinar series explaining Enterprise Data Catalog Advanced Scanners (MetaDex).



Extracting the Profile and Domain Discovery Results from IDQ to EDC

Operationalize Data Governance with Axon, EDC and IDQ

EDC 10.5.x Advanced Scanner Overview and Best Practices


EDC 10.5.x Advanced Scanner Overview and Best Practices




Extracting the Profile and Domain Discovery Results from IDQ to EDC
