Blog: Mike Ferguson Subscribe to this blog's RSS feed!

Mike Ferguson

Welcome to my blog on the UK Business Intelligence Network. I hope to help you stay in touch with hot topics and reality on the ground in the UK and European business intelligence markets and to provide content, opinion and expertise on business intelligence (BI) and its related technologies. I also would relish it if you too can share your own valuable experiences. Let's hear what's going on in BI in the UK.

About the author >

Mike Ferguson is Managing Director of Intelligent Business Strategies Limited, a leading information technology analyst and consulting company. As lead analyst and consultant, he specializes in enterprise business intelligence, enterprise business integration, and enterprise portals. He can be contacted at +44 1625 520700 or via e-mail at mferguson@intelligentbusiness.biz.

Following on from my last blog on data federation, the next data federation pattern I would like to discuss is a On-Demand Information Services Pattern. This is as follows:

 

Pattern Description

This pattern uses data virtualization to provide on-demand integrated data to applications, reporting tools, processes and portals via a web services user interface. Structured and semi-structured data sources are supported including RDBMS, any web service (internal or external), web syndication feeds, flat files, XML, packaged applications and non-relational databases.

 

Pattern Diagram

Blog-TheOnDemandInformationServicesPattern.JPG

 

Pattern Example Use Case

A company needs to different kinds of information services targeted at different role-based user communities for access via their enterprise portal.  These services include:

 

·         Internal operational and analytical information services

·         Information services that integrate structured and semi-structured information including internal and external syndicated web feeds

·         Information as a Service (IaaS) services that  render information in various XML formats (e.g. XBRL) for consumption by external users and applications

 

Reasons For Using It

Rapid development of re-usable information services for consumption by portals, composite applications, processes and reporting tools.

 


Posted December 18, 2009 3:34 AM
Permalink | No Comments |

Following on from my last blog on data federation, the next data federation pattern I would like to discuss is a Master Data Virtual MDM pattern. This is as follows:

 

Pattern Description

This pattern uses data virtualization to provide one or more on-demand integrated views of master data entities such as customer, product, asset, employee etc. even though the master data is fractured across multiple underlying systems. Applications, processes, portals, reporting tools and data integration workflows needing master data can acquire it on-demand via a web service interface or via a query interface such as SQL.

 

Pattern Diagram

Blog-TheVirtualMasterDataManagementPattern.JPG

Pattern Example Use Case

A manufacturer needs to make sure that changes to its customer data are made available to its marketing, e-commerce, finance and distribution systems as well as its business intelligence systems to keep business operations, reporting and analysis running smoothly. A shipping group of companies needs to perform a routine maintenance upgrade on a particular type of asset. However, its assets are managed by different systems in multiple lines of business. In order to budget for this upgrade it needs to have a single view of assets to fully understand maintenance costs. 

 

Reasons For Using It

To obtain a single integrated views of master data for consistency across business operations quickly at a relatively low cost.


Posted December 11, 2009 9:24 AM
Permalink | No Comments |

Following on from my last blog on data federation, the next data federation pattern I would like to discuss is a Data Warehouse Virtual Data Source pattern. This is as follows:

Pattern Description

This pattern uses virtual views of federated data to create virtual data source components for use in ETL processing. The purpose of this pattern is twofold. Firstly to protect ETL workflows from structural changes to operational data sources. Secondly to create re-usable virtual data source 'components' for accessing disintegrated master and transactional data. The virtual data source pattern effectively 'ring fences' just the data associated with a customer, or a product for example, meaning that ETL workflows can be built for customer data, product data, asset data, order data etc.  This helps ETL designers to create ETL jobs dedicated to a particular type of data e.g. the customer ETL job, the product ETL job, the orders ETL job. Simplistic design of data consolidation workflows dedicated to a type of data allows these jobs to be re-used if the same data is needed elsewhere, e.g. customer data needed in two different data marts. It also guarantees that the same data is made available again and again via the same virtual data source  

 

Pattern Diagram

Blog-TheVirtualDataSourcePattern.JPG

Pattern Example Use Case

Merger and acquisitions and new system releases often cause changes to operational systems data structures. This pattern can be used to shield ETL jobs that populate data warehouses and master data hubs from structural changes to source systems simply by changing the mappings in the virtual source views.

 

Reasons For Using It

Reasons for using this pattern include the ability to manage change more easily, lower ETL development and maintenance costs and modular design of data integration workflows associated with consolidating data.


Posted December 4, 2009 3:51 AM
Permalink | No Comments |

Everywhere I look at the moment I see my clients talking about needing to benchmark themselves against the market, to understand customer and prospect sentiment on social networking sites and to understand competitors in much more detail. It is not just me that has recognised this need. It also seems that new young startup companies have also seen this gap in the market. Over the last few days I have spent some time talking to Andrew Yates, CEO of Artesian and Christian Koestler, CEO of Lixto about their solutions in this area.

Artesian are are focused on monitoring media news, competitors intelligence and market intelligence that can be fed into front-office processes - in particular to sales force automation. Integration with SalesForce.com is provided as is delivery to mobile devices for mobile salespeople on the road. Their intention on media intelligence for example is to track coverage across all media channels contextually matched to commercial triggers or specific areas of interest.  What I like about Artesian is the fact that they have looked at how to drive revenue from intelligence derived from web content by plugging it into front-office processes. Also by adopting social software attached to front-office systems like SalesForce.com's new Chatter offering it becomes possible to collaborate over this intelligence. I would like to see this solution integrate with Microsoft SharePoint and IBM Lotus Connections for more use in large enterprises. However, seeing the need to focus attention on content that has real value in the front office is a real strength of this young startup.  

Lixto has a integrated development environment that allows you to build analytic applications pulling data from web sites such as competitor price information, new competitor marketing campaign data and other information that can be loaded into their customisable analytic applications to monitor competitors for example. 

Extracting insight from external data is definately on the increase with YellowBrix and Mark Logic also in on the act. IBM jumped into the market back in October with their announcement of IBM Cognos Content Analytics. This market is heating up. It seems to me that the start-ups are out there with competitive offerings.


Posted November 27, 2009 10:16 AM
Permalink | No Comments |

Following on from my last blog on data federation, the next data federation pattern I would like to discuss is a Data Warehouse Virtual Data Mart pattern. This is as follows:

Pattern Description

This pattern uses data virtualization to create one or more virtual data marts on top of a BI system thereby providing multiple summarised views of detailed historical data in a data warehouse. Different groups of users can then run ad hoc reports and analyses on these virtual data marts without interfering with each others' analytical activity.

 

Pattern Diagram

 

Virtual DM Pattern.JPG

 

Pattern Example Use Case

Multiple 'power user' business analysts in the risk management department of a bank often need their own analytical environment to conduct specific in-depth analyses in order to create the best scoring and predictive models. This pattern facilitates the creation of multiple virtual data marts without the need to hold data in many different data stores

 

Reasons For Using It

Reduces the proliferation of data marts and also prevents inadvertent 'personal' ETL development by power users who have a tendency to want to extract their own data to create their own data marts. It is often the case that each power user wants a detailed subset of data from a data warehouse that overlaps with the data subsets required by other power users. This pattern avoids inadvertent inconsistent ETL processing on extracts of the same data by each and every power user. It also avoids the duplication of the same data in every data mart, improved power user business analyst productivity, reduces the time to create data marts and reduces the total cost of ownership.  

 

 

 


Posted November 27, 2009 7:01 AM
Permalink | No Comments |

Following on from my last blog on data federation, the next data federation pattern I would like to discuss is a Data Warehouse Holistic Data View pattern. This is as follows.

 

Pattern Description

This pattern, also known as the schema extension pattern, uses data virtualization to create a holistic complete view of business activity by combining the latest most up to date operational transactional activity in one or more operational systems with detailed corresponding historical data from data warehouses and data marts.

 

Pattern Diagram

 

 

Holistic Data View Pattern.JPG

 

Pattern Example Use Case

Front-office staff in a call centre operator or a branch of a bank may need to view current risk exposure for a customer they are on the phone to while also looking at a risk exposure trend for that customer across all loan products held. A second use case is regulatory compliance reporting whereby operational and historical data may both be needed for compliance reporting.  

 

Reasons For Using It

This pattern allows companies to quickly show a holistic view of business activity that includes the more recent transactional activity combined with historical activity. This data can be presented for analysis and reporting even if the latest transactional data has not yet reached the data warehouse. 

 

Look out for the next data federation data warehouse patterns on virtual data mart and virtual data source coming soon

 

 


Posted November 16, 2009 10:08 AM
Permalink | No Comments |

As you probably know, Informatica announced Informatica 9 yesterday in a blaze of publicity with, I am led to believe, over 10,000 people registered to view the announcement. So I thought I would make a few comments on what was announced.

The three main strands of the announcement were

·         Relevant data through business-IT collaboration

·         Trustworthy data through pervasive data quality

·         Timely data through open SOA-based services

 

Relevant data through business-IT collaboration includes new Browser-based analyst tools for analysts to directly specify their business requirements, automatic generation of implementation details from business specifications, and a common metadata repository allowing business analysts and IT developers to collaborate and share specification and implementation artifacts with each other

 

Pervasive data quality allows data quality rules to be specified once and reused repeatedly, ensuring consistency across applications. In addition, role-based tools are offered to allow stakeholders to take ownership of their own data quality requirements. Data quality scorecards, simple analyst tools and productive developer tools are also available to empower business users, business analysts, data stewards and IT developers to be directly involved in measuring and improving data quality.

 

SOA data services includes support for

·         Information catalog services to enable users to discover relevant data be it on-premise or in the internet cloud.

·         Logical data objects

·         Multi-modal data provisioning services to deliver data in a multiple formats using various protocols such as web services and SQL

·         Policy-based data services governance

 

In my opinion, differentials include policy-based data services governance (which is very unique) and the Business Analyst tools and collaboration support. The web based Business Analyst tools look a very compelling story although I would however have liked to see more integration with Microsoft and IBM Lotus collaborative tools and workspaces.

 

Data federation and consolidation on the same platform off same metadata with auto generation is a very strong capability. IBM has the same function but auto-generation in their case is out of two separate tools (InfoSphere Data Architect generates data federation logical objects and mappings while InfoSphere Fast-Track generates ETL jobs for Data Stage. Both IBM tools use common metadata however). I would have liked to have seen Informatica go the extra mile and auto generate XSLTs for XML message translation by ESBs/Message Broker products. I don't see this support but equally I don't see it anywhere else either as yet.  In addition I would have like to have seen MapReduce functionality in the announcement to handle Big Data integration. No doubt this is coming.

 

With respect to data services, I don't see ability to publish data services to an Enterprise Service Repository so that these services can be managed centrally in a common place with all other types of service although UDDI support was announced. Some competitors can publish services to ESRs, e.g. IBM with the InforSphere Services Director. Informatica's approach to Cloud Data integration also appears seamless but more information is needed. I understand a new announcement coming soon although they have already announced support for running PowerCenter on Amazon's EC Cloud.  In terms of competition, Microsoft can already run SSIS on SQL Azure cloud to integrate cloud data. In addition, IBM also has multi-modal support on InfoSphere Information Server beyond SQL and Web Services. They also support JAVA RMI, REST as well as SOAP, SQL and X/Query.

 

I would also have liked to see Informatica stick their neck out and acquire a data modelling tool rather than just integrate with everyone else's products. However, overall, this is a strong announcement with another Cloud announcement to come. There is no doubt that integrated Data Management platforms are here now with Informatica and IBM leading the way with e-Clipse based tool suites. SAP BusinessObjects and SAS DataFlux are clearly not far behind.  Expect more from Oracle and Microsoft in 2010.

Looking at the trend here, it is clear that companies need to look seriously at moving from separate data management tools from many different suppliers, each with its own metadata, to single platforms with integrated shared metadata


Posted November 11, 2009 8:26 AM
Permalink | 1 Comment |

So here I am in Las Vegas at IBM Information On-Demand - IBM's global information management conference. The up-coming theme that will be launched here is IBM's new Information Led Transformation (ILT) initiative which opens up IBM's major play in the Business Optimization market.  IBM is pouring enourmous amounts of money into this space, stating that this market is growing twice as fast as any other initiative including Business Automation. Their estimation on market size is $105Bn.  The objective of Information Led Transformation is micro-optimisation whereby every business optimization is carried out in real-time (or should I say right time) at all points of impact. That means optimising all decisions and process activities based on the current situation as it happens by leveraging event processing, predictive analytics, rules engines for automated action management based on a base of trusted information delived on-demand and in-context where it is needed and when it is needed. IBM ILT will leverage

  • IBM's InfoSphere Information Server platform,
  • InfoSphere Streams event processing,
  • Change data captue,
  • In-memory data in SolidDB and Cognos TM1
  • Cognos Performance Management and Analytics,
  • SPSS Predictive Analytics,
  • Automated decisions via iLog rules engine and other technologies such as WebSphere Business Events and Cognos Now!
  • Collaborative decision making via Lotus Connections
  • Process Optimisation using the WebSphere BPM technologies and ESB/message Broker.

On top of this IBM will deliver solutions (both crosss industry and vertical . We are entering an era of business automation to get business optimisation whereby BI is integrated into processes and event driven automated decision making and action taking keep the business running optimally at all points of operation.

In addition, ILT has 4000 IBM consultants already in place to chase business.  Time will tell how successful this initiative is. It is very ambitious but real-time use of intelligence and predictive analytics on an event-driven and on-demand basis is definately the right direction. The challenge here is bringing all these technologies together and getting IT groups to play ball. In addition many businesses need to learn how to optimise their business. Trusted data (via Enterprise Data Governance and MDM) will be fundamental to that as will the need for companies to make an inventory of their business events. Unless companies learn what to look for in different parts of their business they will not be able to maximise the benefits of business optimization. 


Posted October 23, 2009 3:32 PM
Permalink | No Comments |

Following on from my last blog, the next data federation I would like to discuss is the Data Discovery pattern. This is as follows.

 

Pattern Description

This pattern uses data virtualization to query structured data held in multiple underlying core operational and analytical databases and file systems to answer business questions.  It uses a search like user interface that can return results as to where data associated with items being searched on can be found e.g. a search could be done on a customer name, an order and a sales representative name. The data discovery pattern allows users to query the virtual views of a data held in multiple systems via the data virtualization server. Through this mechanism users can find relationships between different data items across systems, view the data as if in a single system to discover answers to business questions.

 

Data Discovery Pattern.JPG

 

Pattern Example Use Case

Call centres are receiving a lot of enquiries as to why their orders are not being fulfilled. Data can be queried using a customer name, products ordered and the sales representative who took the order. Results returned show all occurrences of data about orders, the customer and the sales representative across multiple systems. Using the virtual views, this data can be analysed across systems to see what the reason is for delays in deliveries are e.g. order exceeds credit limit or order cannot be fulfilled due to inventory levels being too low. 

 

Reasons For Using It

This pattern has the affect of broadening access to enterprise data from a much larger user base who are confident in using a search box interface but who are not aware of where the data they need is located and who do not have the time and/or skills to use BI tools.


Posted October 23, 2009 12:31 PM
Permalink | No Comments |

Having seen a lot of increase in demand from my clients to start a program to create information services, I thought it might be useful to look at one way of doing that through the use of data federation software. Then I realised that it would be better to look at all the popular ways of using this technology. On that basis, this blog starts a series of blogs from me on popular patterns that companies can use to get maximum value out of the data federation software.

In order to facilitate ease of understanding, the patterns discussed have been classified into the following categories

  • Business intelligence and performance management patterns
  • Data warehousing patterns
  • Master data patterns
  • Information services patterns
  • Operational patterns
  • Data management patterns

For those of you not sure what data federation is please refer to my 2006 article on the subject.

 

Performance Management Patterns

 Popular business intelligence (BI) and Performance Management patterns for data virtualization software are

  • The BI/Performance Management Integration pattern
  • The Data Discovery pattern

The BI/Performance Management Integration Pattern

This pattern uses data virtualization to integrate multiple underlying line of business (LoB) BI systems with performance management enterprise level scorecards and dashboards so as to allow detailed low level LoB metrics in the underlying BI systems to be used in calculating higher level enterprise key performance indicators in performance management scorecards and dashboards. This is essentially an aggregation pattern. There are two options associated with this pattern. The first is to map the data structures in multiple underlying BI system data stores to the virtual view(s) needed by performance management

 

Pattern Diagram (Option 1)

pattern1.JPG

 

The second is to map the virtual view(s) to underlying BI web services that will retrieve the necessary data from the BI systems as required. These BI web services will typically be BI tool reports and queries published as web services on the BI platform being used. The data virtualization server simply calls the appropriate BI tool(s) via a web service interface to run the report/query to get the data needed to calculate key performance indicators (KPIs) that appear in the performance management scorecard(s).

Pattern Diagram (Option 2)

Pattern 2.JPG  

Pattern Example Use Case

A manufacturer with different lines of business may want to monitor the total cost of shrinkage over all product lines to compare against targets.  A bank may have different BI systems monitoring risk exposure for each of its product lines (e.g. mortgages, credit cards, loans) and wants to monitor corporate exposure across all product lines to see if exposure is in line with targets.

Reasons For Using It

Many companies with multiple line of business (LoB) BI systems cannot answer enterprise level questions. This requires enterprise key performance indicators to be calculated by aggregating LoB metrics in multiple BI systems.

 

In my next blog we will look at the Data Discovery pattern. Click here for more information on Data Governance

 

 


Posted October 7, 2009 4:21 AM
Permalink | No Comments |