META DATA REPOSITORY SHOPPING LIST
by William Laurent

Deciding on a 'best of breed' enterprise metadata repository for your organization can be a time consuming and politically charged affair. Significant bi-directional and extensible integration of metadata from heterogeneous sources to a common environment will be a major task, with many stakeholders--all of whom may have their own visions of what a centralized and distributed repository or repository application suite should be.

Here are a few "checklist" topics that may jump-start your repository needs assessment and subsequent procurement efforts.

What are the Repository's Strengths? Things to Look for:

  • Powerful versioning capabilities help manage and administrate complex configurations of data.
  • Powerful extensibility.
  • Large array of scanners/interfaces, many of which are bi-directional.
  • Strong data stewardship capabilities--for instance: metadata groupings, impact analysis, searching, etc.
  • Strong support of systems development standards.
  • Can accommodate unstructured data. Places no limitations of types of metadata that can be included in repository.
  • Repository client allows for the identification, shopping, and linking to a single business element across the enterprise. This is called data name rationalization, i.e. the automated identification of synonyms utilizing matching criteria, and is driven from a powerful data name glossary list.
  • Sound separation of the repository platform from the database.
  • A solid web-enabled meta data browser: Meta data 'publishing' can be highly customized.
  • Inclusion of a search engine that provides potent meta data and document indexing capabilities.
  • Cataloging of pointers to unstructured metadata present no limitation on what kind of metadata can be managed within the repository, providing true distributed meta data.
  • Underlying relational nature of the repository enables reporting and analysis against the repository using 3rd party SQL compliant tools.
  • Underlying file management system is proprietary and non-relational [can be good or bad] and does not have to rely on external resources to dictate the details of the internal architecture or API's, promoting greater efficiency in design and responsiveness to market conditions.
  • Ease of extension and customization because of the separation of the repository engine from the database.
  • Interactive GUI for quick utilization of the repository, offering interactive and updateable graphic representations of data. A bundled web-enabled application can change query and manipulate data in the actual repository in real time.
  • Supports XML as the standard for meta data interchange, with the goal of progressively reducing the need for customized support of tool-specific environments. Accompanying modeling tool can make XML DTDs and custom repository information models (RMI).
  • A true 3-tier/n-tier solution using browser based clients. May also allow applications to treat the repository contents as a large XML document, which can be viewed in various ways using XML stylesheets.
  • Multitude of customizations possible. Repository of functions and services for C++, VB, and other 3GLs.
  • Effective security model.
  • Supports all standard RDBMS and operating system platforms.

What are the Repository's Limitations? Items to Consider:

  • Proprietary nature of file management system means reporting and analysis against the repository cannot be accomplished using third party SQL-compliant query tools, negatively impacting the accessibility of the metadata.
  • Administration complexity.
  • Lack of BI tool interfaces and scanning.
  • Unstructured data, such as Word documents, are not able to be physically stored in the repository. Instead a directory pointer is maintained in the repository to a physical location on the LAN.
  • Limited OS support, specifically UNIX/LINUX or AS400.
  • Read-only data shopping.
  • Lack of vision or corporate instability of vendor.
  • Rough interfaces and steep learning curve.
  • No easy migration capabilities (i.e. promotion path/gateway from test to production) for meta data.
  • Some degree of proprietary dispensation toward vendor specific ETL tools, making 3rd party ETL tool integration more difficult.

Buying a meta data repository may offer an attractive alternative to a homegrown one built from scratch. It is vital that IT management know what to look for when shopping for an extensible and scalable repository. The preceding checklist is merely a good starting point for the formulation of questions and issues that will help an organization determine the best repository for their needs.

 
     
Copyright © 2005 William Laurent, Inc. All Rights Reserved.