Blog: Mike Ferguson http://www.b-eye-network.co.uk/blogs/ferguson/ Welcome to my blog on the UK Business Intelligence Network. I hope to help you stay in touch with hot topics and reality on the ground in the UK and European business intelligence markets and to provide content, opinion and expertise on business intelligence (BI) and its related technologies. I also would relish it if you too can share your own valuable experiences. Let's hear what's going on in BI in the UK. Copyright 2009 Fri, 18 Dec 2009 03:34:47 -0700 http://www.movabletype.org/?v=4.261 http://blogs.law.harvard.edu/tech/rss Data Federation- Information Services Patterns - The On-Demand Information Services Pattern Following on from my last blog on data federation, the next data federation pattern I would like to discuss is a On-Demand Information Services Pattern. This is as follows:

 

Pattern Description

This pattern uses data virtualization to provide on-demand integrated data to applications, reporting tools, processes and portals via a web services user interface. Structured and semi-structured data sources are supported including RDBMS, any web service (internal or external), web syndication feeds, flat files, XML, packaged applications and non-relational databases.

 

Pattern Diagram

Blog-TheOnDemandInformationServicesPattern.JPG

 

Pattern Example Use Case

A company needs to different kinds of information services targeted at different role-based user communities for access via their enterprise portal.  These services include:

 

·         Internal operational and analytical information services

·         Information services that integrate structured and semi-structured information including internal and external syndicated web feeds

·         Information as a Service (IaaS) services that  render information in various XML formats (e.g. XBRL) for consumption by external users and applications

 

Reasons For Using It

Rapid development of re-usable information services for consumption by portals, composite applications, processes and reporting tools.

 

]]>
http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/12/data_federation-_information_s.php http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/12/data_federation-_information_s.php Fri, 18 Dec 2009 03:34:47 -0700
Data Federation- Master Data Patterns - The Virtual MDM Pattern Following on from my last blog on data federation, the next data federation pattern I would like to discuss is a Master Data Virtual MDM pattern. This is as follows:

 

Pattern Description

This pattern uses data virtualization to provide one or more on-demand integrated views of master data entities such as customer, product, asset, employee etc. even though the master data is fractured across multiple underlying systems. Applications, processes, portals, reporting tools and data integration workflows needing master data can acquire it on-demand via a web service interface or via a query interface such as SQL.

 

Pattern Diagram

Blog-TheVirtualMasterDataManagementPattern.JPG

Pattern Example Use Case

A manufacturer needs to make sure that changes to its customer data are made available to its marketing, e-commerce, finance and distribution systems as well as its business intelligence systems to keep business operations, reporting and analysis running smoothly. A shipping group of companies needs to perform a routine maintenance upgrade on a particular type of asset. However, its assets are managed by different systems in multiple lines of business. In order to budget for this upgrade it needs to have a single view of assets to fully understand maintenance costs. 

 

Reasons For Using It

To obtain a single integrated views of master data for consistency across business operations quickly at a relatively low cost.

]]>
http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/12/data_federation-_master_data_p.php http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/12/data_federation-_master_data_p.php Fri, 11 Dec 2009 09:24:33 -0700
Data Federation - DW Patterns - The Virtual Data Source Following on from my last blog on data federation, the next data federation pattern I would like to discuss is a Data Warehouse Virtual Data Source pattern. This is as follows:

Pattern Description

This pattern uses virtual views of federated data to create virtual data source components for use in ETL processing. The purpose of this pattern is twofold. Firstly to protect ETL workflows from structural changes to operational data sources. Secondly to create re-usable virtual data source 'components' for accessing disintegrated master and transactional data. The virtual data source pattern effectively 'ring fences' just the data associated with a customer, or a product for example, meaning that ETL workflows can be built for customer data, product data, asset data, order data etc.  This helps ETL designers to create ETL jobs dedicated to a particular type of data e.g. the customer ETL job, the product ETL job, the orders ETL job. Simplistic design of data consolidation workflows dedicated to a type of data allows these jobs to be re-used if the same data is needed elsewhere, e.g. customer data needed in two different data marts. It also guarantees that the same data is made available again and again via the same virtual data source  

 

Pattern Diagram

Blog-TheVirtualDataSourcePattern.JPG

Pattern Example Use Case

Merger and acquisitions and new system releases often cause changes to operational systems data structures. This pattern can be used to shield ETL jobs that populate data warehouses and master data hubs from structural changes to source systems simply by changing the mappings in the virtual source views.

 

Reasons For Using It

Reasons for using this pattern include the ability to manage change more easily, lower ETL development and maintenance costs and modular design of data integration workflows associated with consolidating data.

]]>
http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/12/data_federation_-_dw_patterns_2.php http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/12/data_federation_-_dw_patterns_2.php Fri, 04 Dec 2009 03:51:23 -0700
External Data Feeds BI To the Front Office Everywhere I look at the moment I see my clients talking about needing to benchmark themselves against the market, to understand customer and prospect sentiment on social networking sites and to understand competitors in much more detail. It is not just me that has recognised this need. It also seems that new young startup companies have also seen this gap in the market. Over the last few days I have spent some time talking to Andrew Yates, CEO of Artesian and Christian Koestler, CEO of Lixto about their solutions in this area.

Artesian are are focused on monitoring media news, competitors intelligence and market intelligence that can be fed into front-office processes - in particular to sales force automation. Integration with SalesForce.com is provided as is delivery to mobile devices for mobile salespeople on the road. Their intention on media intelligence for example is to track coverage across all media channels contextually matched to commercial triggers or specific areas of interest.  What I like about Artesian is the fact that they have looked at how to drive revenue from intelligence derived from web content by plugging it into front-office processes. Also by adopting social software attached to front-office systems like SalesForce.com's new Chatter offering it becomes possible to collaborate over this intelligence. I would like to see this solution integrate with Microsoft SharePoint and IBM Lotus Connections for more use in large enterprises. However, seeing the need to focus attention on content that has real value in the front office is a real strength of this young startup.  

Lixto has a integrated development environment that allows you to build analytic applications pulling data from web sites such as competitor price information, new competitor marketing campaign data and other information that can be loaded into their customisable analytic applications to monitor competitors for example. 

Extracting insight from external data is definately on the increase with YellowBrix and Mark Logic also in on the act. IBM jumped into the market back in October with their announcement of IBM Cognos Content Analytics. This market is heating up. It seems to me that the start-ups are out there with competitive offerings.

]]>
http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/11/external_data_feeds_bi_to_the.php http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/11/external_data_feeds_bi_to_the.php Fri, 27 Nov 2009 10:16:09 -0700
Data Federation - DW Patterns - Virtual Data Mart

Following on from my last blog on data federation, the next data federation pattern I would like to discuss is a Data Warehouse Virtual Data Mart pattern. This is as follows:

Pattern Description

This pattern uses data virtualization to create one or more virtual data marts on top of a BI system thereby providing multiple summarised views of detailed historical data in a data warehouse. Different groups of users can then run ad hoc reports and analyses on these virtual data marts without interfering with each others' analytical activity.

 

Pattern Diagram

 

Virtual DM Pattern.JPG

 

Pattern Example Use Case

Multiple 'power user' business analysts in the risk management department of a bank often need their own analytical environment to conduct specific in-depth analyses in order to create the best scoring and predictive models. This pattern facilitates the creation of multiple virtual data marts without the need to hold data in many different data stores

 

Reasons For Using It

Reduces the proliferation of data marts and also prevents inadvertent 'personal' ETL development by power users who have a tendency to want to extract their own data to create their own data marts. It is often the case that each power user wants a detailed subset of data from a data warehouse that overlaps with the data subsets required by other power users. This pattern avoids inadvertent inconsistent ETL processing on extracts of the same data by each and every power user. It also avoids the duplication of the same data in every data mart, improved power user business analyst productivity, reduces the time to create data marts and reduces the total cost of ownership.  

 

 

 

]]>
http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/11/data_federation_-_dw_patterns_1.php http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/11/data_federation_-_dw_patterns_1.php Fri, 27 Nov 2009 07:01:32 -0700
Data Federation - DW Patterns - The Holistic Data View Pattern

Following on from my last blog on data federation, the next data federation pattern I would like to discuss is a Data Warehouse Holistic Data View pattern. This is as follows.

 

Pattern Description

This pattern, also known as the schema extension pattern, uses data virtualization to create a holistic complete view of business activity by combining the latest most up to date operational transactional activity in one or more operational systems with detailed corresponding historical data from data warehouses and data marts.

 

Pattern Diagram

 

 

Holistic Data View Pattern.JPG

 

Pattern Example Use Case

Front-office staff in a call centre operator or a branch of a bank may need to view current risk exposure for a customer they are on the phone to while also looking at a risk exposure trend for that customer across all loan products held. A second use case is regulatory compliance reporting whereby operational and historical data may both be needed for compliance reporting.  

 

Reasons For Using It

This pattern allows companies to quickly show a holistic view of business activity that includes the more recent transactional activity combined with historical activity. This data can be presented for analysis and reporting even if the latest transactional data has not yet reached the data warehouse. 

 

Look out for the next data federation data warehouse patterns on virtual data mart and virtual data source coming soon

 

 

]]>
http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/11/data_federation_-_dw_patterns.php http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/11/data_federation_-_dw_patterns.php Mon, 16 Nov 2009 10:08:04 -0700
Informatica 9 Raises The Bar for Data Management Platforms

As you probably know, Informatica announced Informatica 9 yesterday in a blaze of publicity with, I am led to believe, over 10,000 people registered to view the announcement. So I thought I would make a few comments on what was announced.

The three main strands of the announcement were

·         Relevant data through business-IT collaboration

·         Trustworthy data through pervasive data quality

·         Timely data through open SOA-based services

 

Relevant data through business-IT collaboration includes new Browser-based analyst tools for analysts to directly specify their business requirements, automatic generation of implementation details from business specifications, and a common metadata repository allowing business analysts and IT developers to collaborate and share specification and implementation artifacts with each other

 

Pervasive data quality allows data quality rules to be specified once and reused repeatedly, ensuring consistency across applications. In addition, role-based tools are offered to allow stakeholders to take ownership of their own data quality requirements. Data quality scorecards, simple analyst tools and productive developer tools are also available to empower business users, business analysts, data stewards and IT developers to be directly involved in measuring and improving data quality.

 

SOA data services includes support for

·         Information catalog services to enable users to discover relevant data be it on-premise or in the internet cloud.

·         Logical data objects

·         Multi-modal data provisioning services to deliver data in a multiple formats using various protocols such as web services and SQL

·         Policy-based data services governance

 

In my opinion, differentials include policy-based data services governance (which is very unique) and the Business Analyst tools and collaboration support. The web based Business Analyst tools look a very compelling story although I would however have liked to see more integration with Microsoft and IBM Lotus collaborative tools and workspaces.

 

Data federation and consolidation on the same platform off same metadata with auto generation is a very strong capability. IBM has the same function but auto-generation in their case is out of two separate tools (InfoSphere Data Architect generates data federation logical objects and mappings while InfoSphere Fast-Track generates ETL jobs for Data Stage. Both IBM tools use common metadata however). I would have liked to have seen Informatica go the extra mile and auto generate XSLTs for XML message translation by ESBs/Message Broker products. I don't see this support but equally I don't see it anywhere else either as yet.  In addition I would have like to have seen MapReduce functionality in the announcement to handle Big Data integration. No doubt this is coming.

 

With respect to data services, I don't see ability to publish data services to an Enterprise Service Repository so that these services can be managed centrally in a common place with all other types of service although UDDI support was announced. Some competitors can publish services to ESRs, e.g. IBM with the InforSphere Services Director. Informatica's approach to Cloud Data integration also appears seamless but more information is needed. I understand a new announcement coming soon although they have already announced support for running PowerCenter on Amazon's EC Cloud.  In terms of competition, Microsoft can already run SSIS on SQL Azure cloud to integrate cloud data. In addition, IBM also has multi-modal support on InfoSphere Information Server beyond SQL and Web Services. They also support JAVA RMI, REST as well as SOAP, SQL and X/Query.

 

I would also have liked to see Informatica stick their neck out and acquire a data modelling tool rather than just integrate with everyone else's products. However, overall, this is a strong announcement with another Cloud announcement to come. There is no doubt that integrated Data Management platforms are here now with Informatica and IBM leading the way with e-Clipse based tool suites. SAP BusinessObjects and SAS DataFlux are clearly not far behind.  Expect more from Oracle and Microsoft in 2010.

Looking at the trend here, it is clear that companies need to look seriously at moving from separate data management tools from many different suppliers, each with its own metadata, to single platforms with integrated shared metadata

]]>
http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/11/informatica_9_raises_the_bar_f.php http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/11/informatica_9_raises_the_bar_f.php Wed, 11 Nov 2009 08:26:35 -0700
Information Led Transformation - Will IBM's New Strategy Be A Success? So here I am in Las Vegas at IBM Information On-Demand - IBM's global information management conference. The up-coming theme that will be launched here is IBM's new Information Led Transformation (ILT) initiative which opens up IBM's major play in the Business Optimization market.  IBM is pouring enourmous amounts of money into this space, stating that this market is growing twice as fast as any other initiative including Business Automation. Their estimation on market size is $105Bn.  The objective of Information Led Transformation is micro-optimisation whereby every business optimization is carried out in real-time (or should I say right time) at all points of impact. That means optimising all decisions and process activities based on the current situation as it happens by leveraging event processing, predictive analytics, rules engines for automated action management based on a base of trusted information delived on-demand and in-context where it is needed and when it is needed. IBM ILT will leverage

  • IBM's InfoSphere Information Server platform,
  • InfoSphere Streams event processing,
  • Change data captue,
  • In-memory data in SolidDB and Cognos TM1
  • Cognos Performance Management and Analytics,
  • SPSS Predictive Analytics,
  • Automated decisions via iLog rules engine and other technologies such as WebSphere Business Events and Cognos Now!
  • Collaborative decision making via Lotus Connections
  • Process Optimisation using the WebSphere BPM technologies and ESB/message Broker.

On top of this IBM will deliver solutions (both crosss industry and vertical . We are entering an era of business automation to get business optimisation whereby BI is integrated into processes and event driven automated decision making and action taking keep the business running optimally at all points of operation.

In addition, ILT has 4000 IBM consultants already in place to chase business.  Time will tell how successful this initiative is. It is very ambitious but real-time use of intelligence and predictive analytics on an event-driven and on-demand basis is definately the right direction. The challenge here is bringing all these technologies together and getting IT groups to play ball. In addition many businesses need to learn how to optimise their business. Trusted data (via Enterprise Data Governance and MDM) will be fundamental to that as will the need for companies to make an inventory of their business events. Unless companies learn what to look for in different parts of their business they will not be able to maximise the benefits of business optimization. 

]]>
http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/10/information_led_transformation.php http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/10/information_led_transformation.php Fri, 23 Oct 2009 15:32:57 -0700
Data Federation - The Data Discovery Pattern Following on from my last blog, the next data federation I would like to discuss is the Data Discovery pattern. This is as follows.

 

Pattern Description

This pattern uses data virtualization to query structured data held in multiple underlying core operational and analytical databases and file systems to answer business questions.  It uses a search like user interface that can return results as to where data associated with items being searched on can be found e.g. a search could be done on a customer name, an order and a sales representative name. The data discovery pattern allows users to query the virtual views of a data held in multiple systems via the data virtualization server. Through this mechanism users can find relationships between different data items across systems, view the data as if in a single system to discover answers to business questions.

 

Data Discovery Pattern.JPG

 

Pattern Example Use Case

Call centres are receiving a lot of enquiries as to why their orders are not being fulfilled. Data can be queried using a customer name, products ordered and the sales representative who took the order. Results returned show all occurrences of data about orders, the customer and the sales representative across multiple systems. Using the virtual views, this data can be analysed across systems to see what the reason is for delays in deliveries are e.g. order exceeds credit limit or order cannot be fulfilled due to inventory levels being too low. 

 

Reasons For Using It

This pattern has the affect of broadening access to enterprise data from a much larger user base who are confident in using a search box interface but who are not aware of where the data they need is located and who do not have the time and/or skills to use BI tools.

]]>
http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/10/data_federation_-_the_data_dis.php http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/10/data_federation_-_the_data_dis.php Fri, 23 Oct 2009 12:31:21 -0700
Data Federation Patterns Having seen a lot of increase in demand from my clients to start a program to create information services, I thought it might be useful to look at one way of doing that through the use of data federation software. Then I realised that it would be better to look at all the popular ways of using this technology. On that basis, this blog starts a series of blogs from me on popular patterns that companies can use to get maximum value out of the data federation software.

In order to facilitate ease of understanding, the patterns discussed have been classified into the following categories

  • Business intelligence and performance management patterns
  • Data warehousing patterns
  • Master data patterns
  • Information services patterns
  • Operational patterns
  • Data management patterns

For those of you not sure what data federation is please refer to my 2006 article on the subject.

 

Performance Management Patterns

 Popular business intelligence (BI) and Performance Management patterns for data virtualization software are

  • The BI/Performance Management Integration pattern
  • The Data Discovery pattern

The BI/Performance Management Integration Pattern

This pattern uses data virtualization to integrate multiple underlying line of business (LoB) BI systems with performance management enterprise level scorecards and dashboards so as to allow detailed low level LoB metrics in the underlying BI systems to be used in calculating higher level enterprise key performance indicators in performance management scorecards and dashboards. This is essentially an aggregation pattern. There are two options associated with this pattern. The first is to map the data structures in multiple underlying BI system data stores to the virtual view(s) needed by performance management

 

Pattern Diagram (Option 1)

pattern1.JPG

 

The second is to map the virtual view(s) to underlying BI web services that will retrieve the necessary data from the BI systems as required. These BI web services will typically be BI tool reports and queries published as web services on the BI platform being used. The data virtualization server simply calls the appropriate BI tool(s) via a web service interface to run the report/query to get the data needed to calculate key performance indicators (KPIs) that appear in the performance management scorecard(s).

Pattern Diagram (Option 2)

Pattern 2.JPG  

Pattern Example Use Case

A manufacturer with different lines of business may want to monitor the total cost of shrinkage over all product lines to compare against targets.  A bank may have different BI systems monitoring risk exposure for each of its product lines (e.g. mortgages, credit cards, loans) and wants to monitor corporate exposure across all product lines to see if exposure is in line with targets.

Reasons For Using It

Many companies with multiple line of business (LoB) BI systems cannot answer enterprise level questions. This requires enterprise key performance indicators to be calculated by aggregating LoB metrics in multiple BI systems.

 

In my next blog we will look at the Data Discovery pattern. Click here for more information on Data Governance

 

 

]]>
http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/10/data_federation_patterns.php http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/10/data_federation_patterns.php Wed, 07 Oct 2009 04:21:50 -0700
MDM and Cloud Computing Having read David Linthicum's blog on MDM and Cloud computing about the impact on data of applications moving off premise, I have to say that I couldn't agree more with him. What David is pointing out is that the fracturing of data caused by the adoption of cloud computing raises the importance of MDM in keeping disparate data synchronised.

This brings back memories of Business Process Outsourcing adoption several years back and what it did to companies that had no business process integration in place before they outsourced some process activities. The result of that strategy was that it fractured processes even more in many cases and sent some of the data outside the enterprise making it more difficult to get at. As applications go off premise there is a real danger MDM could get out of reach. It requires MDM to start to get implemented to get control over data. SalesForce.com data is already coming inside the enterprise via ETL tools into DWs. Several ETL vendors support this. I just don't think that there has been many bringing it back in to populate MDM. Siperian has some case studies of their MDM customer working with cloud applications - in particuar SalesForce.com. What it does say, is that pursuing a cloud computing strategy on external cloud based virtualized servers without a data governance strategy, could very well wreak havoc on any enterprise.

With virtualization being high on the agenda of many CIOs, I would suggest that they should also keep an eye on risk management and compliance otherwise they could well cause make it harder to achieve trusted data. Without MDM, a clound computing deployment strategy certainly puts an Enterprise Data Quality Firewall and data integration services high up the agenda priority list!

]]>
http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/09/mdm_and_cloud_computing.php http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/09/mdm_and_cloud_computing.php Mon, 28 Sep 2009 13:07:10 -0700
Enterprise Data Governance - Cheers Arthur! First of all let me apologise to all my readers for not having blogged for a while. This year has turned out to be manic - crazily busy. I also confess to having become addicted to twitter - a "tweetaholic" where I have been micro-blogging. If you want to see my tweets you can do so here.  So I return to my blog the day after "Arthur Day" - 250 years ago yesterday a certain young Irishman named Arthur Guiness started a beer making company in Dublin. My topic today is that exciting topic of Enterprise Data Governance.  From research I did in a survey it was clear that many companies at the end of 2008 were not fully underway with Enterprise Data Governance in terms of getting their data under control and into a trusted, well managed state. Many had more to do in terms of organising themselves together with getting the necessary technology and processes in place to do this.  But the question I get asked the most is how do you know how well or poorly your company is governing its data? There are a few questions you can ask that will give you a good inkling.  These are as follows:

 

·         Do you know what data exists in your enterprise?

·         Do you have an inventory of data items in use?

·         How many names have you got for the same data item?

·         How many metrics with same name but with different formulae?

·         Do your Excel metrics formulae, DBMS metrics formulae, BI tool metrics formulae, ETL tool calculations, ... all agree?

 

If the answer is no to any of these questions, what chance do you stand of remaining compliant or of trusting your data? If you don't know how many different variations of a data item exist in your enterprise how can you govern your data?

 

Some other questions to ask here from a business perspective are:

·         How many times do your core processes break because of dirty data?

·         Have your company ever messed up an order and angered a customer because of dirty data?

·         In terms of compliance, do you trust your data enough to tell it to a judge?

 

In my opinion what companies actually need is an interactive data map so that you can press a button and see where your customer data is or where your order's data is.  In order to be able to do this you need to have common data definitions for your customer data attributes and for your order data attributes etc. In fact you need to have a common set of enterprise wide definitions for all core entity data, transaction data and metrics. 

 

Having established this, the next step is to discover where you data actually is. Therefore Data Discovery technology (albeit a new area in data management) is critical help do this. In fact I would go as far as to say that without data discovery technology it is very difficult to get data under control. Increasingly therefore we are seeing vendors acquire or build this kind of software.  Once you data is located you need to map disparate data definitions for the same data to common enterprise wide definitions to be able to see where data is. Physical column names, data models, BI tool semantic layers, reports, SPREADSHEETS, files, XML schema, Access databases... If you can't tie all these to the same corporate definitions how to you govern data?  This is where data dictionaries/ business glossaries are key.  Lineage matter.

Ultimately the objective is to get consistency across the enterprise. You have to unravel your spaghetti ball.  

This means you need to get organised correctly, get the right technologies in place and get the right processes in place for enterprise data governance. 

 

Master data is also part of the program. You need to find out where your master data is maintained. How is it synchronised? What screens on what applications are used to update it? Do you know? MDMis not as simple as it looks. It is often a multi-year investment. So ask yourself do you want to start with read-only or read write? If  you buy an MDM system and you start updating master data centrally, what happens if you are still updating it also in other applications? Companies need a well thought out strategy for MDM as part of the Data Governance program. I will be addressing this in further blogs in the near future.

 

For now though, let me salute my fellow Irish countryman. Arthur Guiness. If you are stressed out on Data Governance at the end of a hard week there is nothing like a decent pint to help you unwind. Cheers!

 

IMG_2224[1].JPG

 

 

]]>
http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/09/enterprise_data_governance_-_c.php http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/09/enterprise_data_governance_-_c.php Fri, 25 Sep 2009 07:40:19 -0700
Data Federation - Rapid Information Delivery in a Tough Economy? Increasingly as I speak with my clients and CIOs I meet at various speaking engagements around UK and Europe,  it is becoming clear that data federation can potentially offer rapid value to IT budget constrained companies that just can't find the resources for another major database project. It may be that if you work in IT you are seeing increasing demand from business users for more reports requiring BI and non-BI information to help them manage their area of business responsibility in a more dynamic way.

In a recent paper by Jeremy Hope on Transforming Performance Management he states that "Most organizations want to adapt rapidly to changing events, but find that they are handicapped because of fixed budgets and poor forecasts. Adaptive organizations are able to respond more rapidly by switching resources dynamically to meet new threats and opportunities... ". 

In order to do this there is no doubt that companies have to deliver information more rapidly irrespective of whether or not the data is in a BI system. What many cannot afford is going through a formal time consuming process of data warehouse change to bring in all information necessary. Data federation is capable of sourcing data from several places one of which would of course have to be a data warehouse or data mart. But with increasing amounts of valuable information residing outside BI systems (especially on the internet) it seems that data federation has a role to play as a delivery platform rather than having to change data models, ETL processes and creating another cube or relational data mart. My expectation is that over the coming year we will see an increase in demend for data federation software. 

Several BI tools have been shipping data federation software as part of the BI tool bundle for some time to allow quick delivery of integrated information. Note that we are not talking about virtual data warehouses - on the contrary, data federation sofware rapidly integrates historical DW data with other data sources (operational data, internet feeds etc.) to deliver higher value information more rapidly. Therefore I see data federation software as complementary to BI systems.   If you would like more information on data federation please see this article on how it works. There is also a white paper on Maximizing Business Value from Data Virtualization  that talks about patterns and best practices to get the most out of this software. If you have alreay purchased data federation software or are considering it, let me know.  

  

]]>
http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/04/data_federation_-_rapid_inform.php http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/04/data_federation_-_rapid_inform.php Wed, 01 Apr 2009 06:56:05 -0700
Data Governance losing priority in some European countries Having just got back from a presentation tour in mainland Europe, it  seems that in the countries I have spoken that Data Governance came out with a thumbs down vote among CIOs present in my sessions.  In particular in Belguim it would appear to be not on their radar.  Having probed for feedback into what exactly is high priority among CIOs attending my sessions it is almost as if raw 'survival' is taking hold. In other words, any IT project linked to business survival in this tough economic climate will get attention but not much else.  Customer retention, self- service, cost reduction/containment and growth are high on the list. One CIO explained to me that his companys' priority over the next 12 months was to allow customers to customise the products and services they offer much more in the future. Therefore in addition to offering their own product lines on the web, they would be integrating their  e-procurement with many back end e-suppliers so they can buy 'on-demand' to match what a customer wants. This means they want to allow customers to create their own custom 'package' before buying on-line and will stretch beyond their own products to stand out from the crowd.  It seems to me that data governance and data quality to some extent are taking a back seat in favour of investment that will keep the revenue rolling in. I would be interested in your feedback.  Is data governance a high priority in your organisation?    

]]>
http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/03/data_governance_losing_priorit.php http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/03/data_governance_losing_priorit.php Mon, 30 Mar 2009 06:40:21 -0700
Flexible Fact Tables - Best Practice or Rope To Hang Business Users With? Blogging on the BeyeNETWORK occasionally offers up opportunity to open up a good debate. So here goes! Over the last several years I have observed data models in many different BI systems across different vertical industries where so called 'generic' fact tables have been designed with only one 'generic' measure. The objective of the design approach is that the measure in the fact table is supposed to hold ANY metric. Often this 'generic' measure column is then accompanied by some kind of type field to indicate what the measure actually is (what it means)  and some other attribute(s) to indicate the level(s) in various dimension hierarchies that the measure stored is associated with.  This helps indicate the additive nature of the metric. Also if it is a monetary measure it may have a currency field and if it is a unit measure it may have a field to explain the kind of units used e.g. centimeters, litres, cubic metres etc. The stated advantage of these kinds of approaches is flexibility. Adding new measures becomes easy to accommodate as no change to the design is necessary.  It is a perfectly good arguement and certainly appears widely practiced by designers.  

When it comes navigating such designs to develop queries (or even generate them) it is often the case that IT professionals developing reports for the business can figure out how to use retrieve the information required  (although even IT developers can struggle). However when it comes to business users developing their own ad hoc queries and reports I frequently see these users really struggling to navigate the 'flexible' design first trying to figure out what measures mean, if the measure(s) is/are additive and whatnot. More often than not I see this resulting in real frustration among business users who end up getting aggregations in reports wrong and then start to lose faith in their new BI system.  Of course IT steps in to rescue the situation by building more snapshot tables, more materialised views etc. burying generic 'complexity' to make the job easier for the user.  More often than not these users also often resort to switching back to Excel to hold data outside any data mart so that they can look at data in a form they understand.

Have you seen this in your organisation?  If so I want your feedback. Is it the case that so called  'flexible' design techniques are rope for end users to hang themselves with?  My question is this. What is the best way that you see to design fact tables so that business users become productive and can easily understand how to get at the data when building their own reports? I am not so sure that being so generic is of business value.  Sure it is flexible. But is it usable? What use is flexible design if a business user cannot understand it and make use of all that valuable data? Is it not better to have multiple metric attributes in a fact table (if multiple metrics are needed) with each attribute name saying what the measure actually is?  Let's have your input!    

]]>
http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/03/flexible_fact_tables_-_best_pr.php http://www.b-eye-network.co.uk/blogs/ferguson/archives/2009/03/flexible_fact_tables_-_best_pr.php Fri, 20 Mar 2009 02:34:55 -0700