Information Maturity in Large Organisations through Business Semantics: a Business Case at the Flemish Public Administration
Published: June 1, 2010
This article describes the support of the Flemish Ministry of Education and Training (FMET) in their next big leap toward information maturity. Using a Metadata Roadmap, we combine metadata technology, methodology, culture, and organization that will allow FMET to semantically unlock their earned information and so revalue it.
Adequate information management requires more than persistently storing data, a function databases already provide. Despite the rigorous structure that may have been imposed on data, if it cannot be disclosed to third parties, their value is practically zero. The ICT-outsourcing partnership between the Flemish Ministry of Education and Training (FMET) and EDS-Telindus, now HP, recognizes the need for data governance. This is shown by initiatives such as the development of a Data Warehouse and an Information Governance Organization.
In this article, we describe the support of the Flemish Ministry of Education and Training (FMET) in their next big leap towards information maturity. Using a Metadata Roadmap, we combine metadata technology, methodology, culture, and organization that will allow FMET to semantically unlock their earned information and so revalue it.
Closed World SyndromeThe ability to unlock data is related to the ability to understand the hidden information in it. We aim to give a meaningful answer to questions about the following mission critical data:
Just-in-Time InformationTo answer the above questions, the ICT-outsourcing partnership aim at just-in-time information (JITI), we define as the ability to interpret the latent information in exchanged data in the right context and in a timely manner, without the help of the original designer. The need for JIT information follows from the fact that information forms arguments during strategic decision making. Consider, for example, the minister who asks a report on the possible influence of the mother’s educational profile on her children’s school performance. People are in constant need for relevant JITI in order to analyse this correlation and finally to allow the minister to make well-founded political decisions. This calls for a pragmatic approach.
Metadata ManagementFor JITI to be effected, the ICT-outsourcing partnership is convinced that they, as earlier for data itself, must pay attention to the structured recording and publishing of data about data, also called metadata. The Flemish regulation1 describes metadata as:
"Documentation that describes the content and frequency of updating an authentic data source, and technical manner in which that resource can be unlocked.”
This definition remains vague when it comes to defining metadata, but clearly hints that metadata should have a dual utility (Akkermans and Gordijn). On one hand, it should refer to a formal specification for computer-based applications. On the other hand, it should refer to real-world objects in the business domain.
Enterprise applications, such as master data management and business intelligence claim to provide a total solution for metadata management. In practice, however, they produce redundant or anomalous metadata because they do not take into account the interest of other applications (again the closed world syndrome). Furthermore, the metadata is of a mere technical nature, which at best provides only a partial answer to those questions mentioned earlier. Indeed, as will see, there are inherent limits on just how much technology can really help in this regard.
In current metadata management practices, the underlying methodological principles are usually ignored (Cardoso, 2007). Moreover, as proven by the narrow focus on technical metadata, they systematically ignore the subtle gap that looms between (i) knowledge sharing between people on the business/social level (based on business metadata), and (ii) data exchange between computerized information systems at the operational/technical level (based on technical metadata).
According to De Leenheer et al. (2010), the basic principle of metadata management lies in capturing the co-evolution of (a) social knowledge sharing and information needs that emerge from it, (b) the supporting computerized information systems, and (c) the metadata that enables data exchange between these systems. Doing so, it imposes a conceptual bridge between business (metadata) and technology (metadata).
Business DriversBased on our interviews, within the ICT-outsourcing partnership metadata management is fueled from seven business drivers.
"Sometimes people do not know what data is out there. Knowing the whereabouts of data starts with many calls within and between business units over and over again.'' (Anton Derks)
Business (intelligence) metadata is key to the provision of relevant documentation. Employees must document their data in a systematic manner in order to minimize the loss of know-how in case they would leave the organization. This is especially important in the context of the ICT-outsourcing partnership in which a regime of high turn-over dominates. Business Analysts must define relationships between documents that document the reasoning of their strategic advices. The collective practice of documentation will progressively decrease the overhead brought by repetitive calls for often the same latent documentation.
"No communication without metadata!'" (Jan Dejonghe)
Metadata facilitates communication both internally as externally. For example, vendors of administrative school software should be able to understand the semantics the software should obey when checking in enrolment data with FMET. This can be formulated in the form of a technical data specification (e.g., in UML or XSD) that was generated from business metadata that is on its turn compliant to certain administrative regulations. Doing so, metadata provides a "language" that can cope more effectively with communication problems between business and the in-and outflow of external ICT consultants.
Metadata accelerates the retrieval of assets and promotes their reuse. Currently assets, including reports, queries, data, architecture, technology, and licenses are defined ad hoc. In the planned service-oriented architecture, metadata will facilitate the retrieval and reuse of software services.
Codification strategy where data is codified in a common format that makes it easy to exchange and reuse is not the silver bullet. Personalisation complements this via right tooling and culture, which allows next to codification, the emergence of personal networks, engendering reputation among its peers. Therefore, it is equally important to address the provenance of data. If one knows the owner of a data asset, one can use the personal network (with Web 2.0 tooling) to gain additional know-how through socialisation. p147
"You hear now and again, ‘they have changed something to the data table, and now it does not work anymore.’''(Frans Decuyper)
Metadata is crucial to capture complex dependencies between different systems, people and applications, and to calculate the impact in the event of a change transparently. A precise impact analysis allows a more precise cost-benefit analysis. Moreover, it reduces the likelihood that they are subsequently surprised by unexpected side effects. The ICT-outsourcing partnership as is often can't see the forest for the trees.
"There are 180,000 teachers, more than one million students and thousands of educational institutions. FMET forms a large part of our society, hence it is certainly important that labels are attributed the same meaning.'' (Martin Maesen)
Metadata helps to get rid of inconsistencies or ambiguities. It is very valuable to know that a term has an unambiguous meaning. For example, the term "family" is fairly easy to understand for most people, but in the software application it has a strict sense, inferred from the legislation. Another problem is caused by (naturally occurring) homonyms. For example, if one would check all decrees ranging from primary to higher education, the word “study area” is often attributed contradictory meanings, while one would expect this in FMET frequently used term should be intuitively obvious. There are also concepts that are poorly defined and require additional interpretation. An example is a term "part exemption” that is applied in regulation but apart from that has few leads to its definition. This brings about the question whether we should take the definition into account are not. These are issues that continually recur in discussions with institutions about changing data models (“yes, but… what about the part exemptions''). Several discussions are repeated over and over again, and would be unnecessary if the relevant terms would be disambiguated properly.
"In my early years I have worked on a glossary: the most difficult part was to obtain consensus." (Marleen Deputter)
Unambiguous metadata is not sufficient. It is crucial that metadata uniformly applies to the entire organization and its stakeholders. It is very valuable for everyone to be sure that what is said in one place, is also valid elsewhere. A major source for the lack of uniformity is the wave pattern that FMET follows when it evolves: first, at the business level, there is a new decree (or set of related decrees). Next, this decree is "implemented" in a computer application. Inherent to its nature, legislation may change, giving rise to an organic growth in the implemented applications. An example is their Salary System and more specifically the lack of meaningful codes that are contained therein.
Metadata plays an important role in regulatory compliance, a field that is not much explored so far (Ryan et al., 2004):
Metadata Landscape SWOT AnalysisAs with information management (Maes, 1999), the strategy, structure, and operationalisation of metadata management is not trivial because it must be able to align the complex and rapidly evolving information needs to the ditto business needs.
Technological support for metadata management in terms of software is necessary, but certainly not enough. Therefore we distinguish, alongside technology, three other dimensions in the metadata landscape: methodology, organization, and culture. Crucial is the development of a teachable and repeatable methodology consisting of a number of coordinated methods and techniques that allows the organization to perform different metadata management activities effectively and efficiently. This methodology should be anchored in an organizational arrangement of social roles and responsibilities, appropriate to the establishment of the Information Governance (see Decuyper, 2009). Finally, in order to apply the methodology properly, it is of great importance that there is a right culture of joint understanding, managing and using metadata in the cross-process information chain. In the context of knowledge management, [Kankanhalli) indicated that social incentives brought about by cultivation are essential to increase the usage of tools and methods.
Designing a solution requires a metadata landscape analysis that deepens each of the four dimensions, and consequently focuses on the internal weaknesses and external threats. To this end, we have interviewed more than twenty people on both sides of the partnership. Based on our findings, we performed a strength-weakness (SWOT) analysis (De Leenheer et al., 2009).
The resulting SWOT diagram is shown in Figure 1. An important strength is that metadata culture is already fairly mature. This is shown by the intention of its management, the curiosity of its employees (taking grass root initiatives providing success stories), and the targeted conceptual training. Another important strength is the outsourcing partnership that already successfully exists for many years.
On the contrary, the weakness lies in the fact that the current metadata technology, methodology, and organization are not adequate. Other threats are the constantly evolving policy changes that impede a smooth long-term rollout of information management. This is aggravated by the high turnover, we already discussed. Moreover, if FMET does not act fast, there is a threat that other public administrations will enforce their own metadata standards, making FMET a mere dependent entity.
The opportunities include the growing availability now of commercial semantic technology that can be deployed to gain information maturity; and the demand for exchange of experiences, especially strong in the context e-government. Moreover, the ICT-outsourcing partner is present throughout all ministries, which provides a wide platform for exchange of best practices.
Figure 1: SWOT Analysis of the Metadata Landscape (De Leenheer et al., 2009)
Business Semantics ManagementBusiness Semantics Management (BSM) (De Leenheer et al, 2010) provides methodology, technology, culture and organization that enable parties to (i) obtain consensus on (the semantics of) key business concepts, and (ii) apply this consensus uniformly throughout the organization. Respectively, BSM consists of two complementary cycles: semantic reconciliation and semantic application (see Figure 1) that each group a number of activities.
Figure 2: Business Semantics Management consists of Two Complementary Cycles: Semantic
Reconciliation and Semantic Application. Both Cycles Communicate via the Unify-Activity.
Contrary to what some middleware vendors claim, it is practically impossible to implement and maintain a central "metadata repository." There are several reasons for this:
Business Semantics GlossaryBSG provides a single point of reference for FMET’s business vocabulary and rules. To this end, we adopt OMG's Semantics for Business Vocabulary and Business Rules (SBVR) standard.
The BSG supports the semantic reconciliation process.
Figure 3: Screenshot of the Term "Vestigingsadres" (English: home address) in the
Business Semantics Glossary. Even Though the Concept Definitions Look Like Natural Language. One Can Automatically Generate an Enterprise Information Model from it that Provides a Formal Specification.
Enterprise Information ModelAn enterprise information model (EIM) is a "flattened" version (e.g., in UML or XSD) of the BSG that is generated in a timely manner. This turning point in the BSM methodology from reconciliation into application of the business semantics happens during the unification activity. The EIM serves as a uniform technical specification to implement semantic applications. We distinguish two activities.
Figure 4: Full-Cycle Business Semantics Management
Metadata Architecture and GovernanceOf particular importance is that BSM becomes structurally and gradually embedded in FMET’s Enterprise Architecture by means of a Metadata Architecture. The basis for this is the EIM. Metadata Governance is concerned with establishment, modification, and implementation and monitoring of the Metadata Architecture by using BSM so that the ensuing business information systems will optimally contribute to the desired business results. While fueled by business drivers, the implementation of BSM is determined by a metadata charter, principles and policies.
ConclusionThe embedding of BSM in FMET requires a planning to implement a coherent set of projects in line with the ICT Strategic Plan 2010-2014. Eventually this should bring the partnerships information management in 2014 to an acceptable level. BSM constitutes a powerful catalyst to align and fuel just-in-time information management processes from the business and supporting technical data management processes. Doing so, ICT can be used effectively and efficiently.
The yardstick that we use to measure information maturity is the Information Maturity Model (IMM)2 (Figure 5). From implemented proof-of-concept we have reviewed some aspects BSM. If we project our findings from these PoCs on the IMM, we conclude that FMET was at IMM level 2 at the beginning of our analysis, and that the organization is not far off from achieving level 3. The five-year plan aims level 4.
Figure 5: Information Maturity Model: A Practical Yardstick to Qualify Information Maturity (by courtesy of Sean McClowry).
Achieving Level 4 IMM will provide a platform with many new capabilities such as the development of Semantic (Business) Intelligence and Semantics-Driven SOA. Moreover, the outreach of best-practice applications and associated metadata deliver a unique reputation to FMET. For example, the European ISA program tries via its Semic.eu3 platform to promote sharing and standardization of metadata for public administrations.
Recent articles by Stan Christiaens
Stan Christiaens - Stan Christiaens is co-founder and operational director at Collibra, a data governance enterprise software company. As such, he has a global responsibility for all technical pre-sales, implementation and support activities. This allows him to have a front-seat view on real customer demands, issues and implementation challenges. Prior to founding the company, he was a senior researcher at the Free University of Brussels STARLab, a leading semantic research center in Europe, performing application-driven research in semantics. As such, he participated actively in several international (ITEA, FP6, FP7) research projects and conferences (OTM, FIS, ESTC). Stan has also published various articles in the field of ontology engineering. He is an active DAMA member and speaker at DAMA events in Europe, and he was recently one of the best received speakers at the IDC SOA event in London with his presentation on “Business-Driven SOA.”
Pieter De Leenheer, Ph.D. - Dr. Pieter De Leenheer is assistant professor at VU University Amsterdam in the Business, Web and Media group (since November 2009). He is also co-founder and research director of Collibra NV/SA, a Brussels-based semantic technology spin off from the Vrije Universiteit Brussel (VUB), Belgium. From 2002-2009, Pieter was senior scientist at VUB STARLab and lecturer at the same university.
Pieter holds a PhD in computer science on community-based ontology evolution, and a MSc in principle computer science. Pieter authored more than 30 publications in various books, international journals and conferences, among which he co-edited the Springer book Ontology Management for the Semantic Web. He gives master lectures including Database Theory, (Web) Information Systems, and Semantic Web languages. He is member of ACM and IEEE.
Aldo de Moor - Aldo de Moor is owner of the CommunitySense research consultancy company. The firm's mission is to link academic researchers and practitioners in the rapidly advancing field of community informatics, and to translate state-of-the-art insights into practical solutions for clients. Aldo earned his PhD in information management in 1999, from Tilburg University in the Netherlands. From 1999-2004, he was an assistant professor at the Department of Information Systems and Management at Tilburg University. From 2005-2006, he was a senior researcher at the Semantics Technology and Applications Research Laboratory (STARLab) of the Vrije Universiteit Brussel in Belgium.
Aldo was a visiting researcher at the University of Guelph in Canada and the University of Technology in Sydney, Australia. Aldo was program co-chair of the International Conference on Conceptual Structures, the Language/Action Perspective Working Conference on Communication Modeling, and the Pragmatic Web Conference. His writings have appeared in journals such as Communications of the ACM, Data and Knowledge Engineering, Group Decision and Negotiation, Information Systems, Information Systems Frontiers, and Information Systems Journal.