Alexander N. Podobry, Vladimir A. Maklaev

Ulyanovsk State Technical University, Russia

Approach to Uniform Integration of Corporate Data Warehouses for Designing of Computer-Aided Systems

 

An integrated information system of Company management shall have, as an information basis, a common distributed database where every user with the requisite access rights may promptly receive data from different information sources. The designing of effective and secure information infrastructures is to be based on applying of formal models for metadata structuring, functioning and isolation of access to information resources.  

The integration of corporate data warehouses requires the designing of a common cloud to represent the information on implementation of a product-lifecycle stage, and the designing of an experience base for a Design Organization. For this reason, a federated architecture for data-storage is to be designed. This type of architecture will allow isolating data warehouses taking into account the needs and the purpose of computer-aided and information systems, and combining them due to a common structure of interconnections. To design this architecture, basic components of information and computer-aided systems are to be sorted out, that is, corporate data warehouses are to be uniformed.  The uniform representation of corporate data warehouses and their descriptions in metadata will allow automating at maximum the design of services to perform sorting out and to process data sets which meet the defined conditions and constraints. The basis for the suggested integration of information resources of computer-aided systems forms the data sets, which are stored in independent corporate data warehouses.

The information system, per se, is a set of objects interacting with each other. On that premise, the following main object types can be sorted out:

- documents – an information, fixed in the information system;

- reference books – an object for storage of data with the same structure and in list-form;

- Registers – tables for on-line data accumulation and a summary acquisition;

- Constants – reserved values for variables.

Each object type includes the list of superclasses, i.e., components of the same type. A superclass includes components describing this object. These components may refer to a source object and represent the dependence between a subclass’s components and another component, which is the superclass one. On this basis, a diagram of dependencies between information objects is proposed (fig.1).

Fig. 1 Uniform Structure of Information Systems

A majority of information and computer-aided systems measures up to the represented description in terms of “document” and “reference book”.     

Thus, in the given example of “1C 7.7” Corporate Information System, the document “Product Order” has the field “Product” which refers to the reference book “Products” where objects refer in turn to another reference book, etc.

There are three main tables for metadata storage at the database level that are suggested (fig.2):

         - META_SYSTEM_PARAM – list of information systems

         - META_OBJECT – list of objects of information systems and their interconnections

         - META_DICT – reference book for interpretation of metadata-structure characteristics

Fig. 2 Structure of Metadata at Database Level

The reference book for lists of information systems includes the name, description and accounting data to link to a system’s data warehouse. The table of a list of information systems’ objects represents an undirected graph wich has, as a point, an object of the information system, and, as an edge, a linkage between objects.

On the basis of this metadata structure, a query to data sets may be formed taking into account the structure for writing queries in Transact SQL. The major components of this language are: Select, Join, and Where (fig. 3).

Fig. 3 Structure for Designing SQL-Query Based on Metadata

The component Select is formed on the basis of the fields, discribed in META_OBJECT table, and is interpreted in LINK_OBJ attribute. The linkage table Join is formed based on the reference fields LINK_DICT è LINK_DICT_COL.  The parameter table Where includes the conditions applied to the fields’ sets to limit the output of data. After the major components of the query generator are populated, a SQL-function appears, allowing appplying functions to manupulate data sets. As a result of the described manupulations, we have the final Transact SQL query text referring to independent corporate data warehouses.   

The full information on the final query is stored in the table of metadata and represents the fields’ sets and their linkage with other objects, both within a corporate data warehouse, and with another warehouse (fig.4).

Fig. 4 Storage of Information at Database Level

The remaining two components of the query design may be applied to the final query: the storing and grouping tables are implemented as in the Transact SQL language.

Thus, taking into account the peculiarities of a manufactuing enterprise, the result of integration may be shown as the data representation module which gives information both to design the experience base for the Design Organization that is implemented on the basis of a set of facilities, called WIQA (Working In Questions and Answers) [4, 5] which is to serve the workflows “Experience Interaction” at the stage of a concept design of a CAS (computer-aided system); and to track the product lifecycle.  This module receives information from the main corporate data warehouses on the basis of metadata (fig. 5).

Fig. 5 Structure for Integration of Uniform Corporate Warehouses

The data representation module may be shown as a service allowing sorting out and processing of data according to the predefined conditions and constraints. Based on the module’s structure, implemented on the basis of MS SQL, data may be represented both in a form of data sets and as Web-pages.

These services represent an accumulated “knowledge base”, i.e.,  data library available for everybody’s use. That is, evey task under solution may represent a groundwork for other tasks. Thus, the approach given meets main requirements of SOAs (service-oriented architectures). The result of the service work may be represented by “data marts”, i.e., a set of the data required to resolve a particular task.

To this point, the corporate data warehouses have been uniformly integrated, at that, the warehouses’ objects have been included into the metadata structure. We have represented a diagram of data integration by example of a manufacturing enterprise.    The data representation module implemented on the basis of MS SQL Server DBMS (database management system) has been described. The structure for data storage allows using OLAP data representation methods and including new independent data warehouses into this structure.

 

BIBLIOGRAPHY

1.     ISO/IEC Std 2382-1

http://www.morepc.ru/informatisation/iso2381-1.html

2.     Lisin, Nikolay. G. Patchwise Automation or How to Manage a “Zoo” of Software Programs //BYTE Publishers, 2009

http://www.bytemag.ru/articles/detail.php?ID=14862

3.     Voroysky, Felix S. Informatics. New Methodical Reference Dictionary (Introduction to Informatics and Computer Science in Form of Terms). – 2nd Revised and Enlarged Edition, – “Liberia” Publishers, 2001.

4.     Maklaev, Vladimir A., Andrey A. Pertsev.  Regulations    Professional Maturity of a Design Organization. – Ulyanovsk: Ulyanovsk State Technical University, 2012 – 300 p.

5.     Sosnin, Petr I. Question-and-Answer Programming of a Man-Machine Interaction. – Ulyanovsk: Ulyanovsk State Technical University, 2010 – 240 p.