IBM India Research Lab Challenges in Building a
® IBM India Research Lab Challenges in Building a Strategic Information Integration Infrastructure Mukesh Mohania IBM India Research Lab © 2006 IBM Corporation
IBM India Research Lab The Integration Challenge § Complex and heterogeneous environments 4 Many different types of systems 4 Many inter-related applications § Escalating needs 4 Variety, velocity, volume § People are expensive The world produces 250 MB of information every year for every man, woman and child on earth. 2
IBM India Research Lab The Challenge Continued… Only 1/3 rd of CFOs believe that the information is easy to use, tailored, cost effective or integrated. 60% + of CEOs: Need to do a better job capturing and understanding information rapidly in order to make swift business decisions. 85% of information is unstructure d. Customers Trx. 30 -50% of Employees Partners Products Orgs. Financials 42% of transactions are still paperbased. e-Mails Reports The average billion dollar company: 48 disparate financial systems Sources: IBM & Industry Studies, Customer Interviews, Forrester application design time is spent on copy management. Databases Web Content Media Documents 79% of companies have more than two repositories and 25% have more than 15 30% of people’s time: searching for relevant information. 40% of IT budgets may be spent on integration. 3
IBM India Research Lab Taikang Life Insurance Background § 4 th largest Chinese insurance company § 8, 000 employees, 150, 000 agents § 3. 5 million customers Business Challenge § 28 branches, 170 sub-branches § Data in DB 2 UDB, Informix, Oracle, SQL Server, XML, e-mail, CRM and Portal applications § Goals: 4 Up-to-the-minute status for executives 4 Increased employee productivity 4 Better customer service Technical Challenge 4
IBM India Research Lab Taikang Integrated Information Platform Architecture Channels Phone Fax SMS Email Web Store Front Mail Agents Financial Planner Application Platform XML SQL Web Services Data Service Information Integrated Integration Information Platform ODS cache Mapping (nicknames) Core Systems Informix Group & Banking DB 2/40 0 CSC Personal Life Oracle Financials 5
IBM India Research Lab Challenges in Integrating Information § Structured and unstructured data § Diversity of data sources (content repos, pricing application, databases, …) § Coming up with the model of how information fits together 4 Understanding what info exists 4 Finding related pieces 4 Creating a common format § Deciding how to access and transform data 4 What should be materialized, what accessed in real-time, how maintained 4 What pre-defined paths, what unplanned (navigation vs. search) § Configuring the appropriate software § Accessing information in the application § Monitoring the system and understanding usage, problems, etc 6
IBM India Research Lab Another perspective --- 7
IBM India Research Lab Virtualization: Grid Computing § Virtual, collaborative organizations sharing Virtual Servers, Storage and Instruments apps, data in open heterogeneous environment. § A potentially vast aggregation of geographically dispersed computing resources § Leverages Intranet, Extranet, and Internet implementations Grid Middleware Distributed Physical Servers and Storage § Lower TCO (Total Cost of Ownership) 8
IBM India Research Lab Data Virtualization for Information on the Grid § A Grid should allow information to be: 4 § 4 4 4 § Virtualized over Heterogeneous, Distributed Data Sources location & heterogeneity transparency Accessed via Open Protocols Autonomically administered Dynamic Putting information on the Grid enables: 4 Access to any data resource in a standard way 4 Viewing a collection of data resources as a single integrated entity 4 Placing data so as to exploit available processing/storage for performance and scale Lower TCO 9
IBM India Research Lab Distributed Data Management and Grid Computing Tasks Required Collaboration & data sharing Performance & Scalability Technologies Federation Consolidation Information Dissemination Replication Caching Parallelism Reduced Cost Autonomic Mixed Workload Mgmt Business Resiliency Replication Fast Backup & Recovery At Lower TCO (Total Cost of Ownership) Enhance Current Technologies Static & Manual Dynamic & Autonomic 10
IBM India Research Lab Data Virtualization: Grid Middleware for Integration & Qo. X Middleware masks dynamic nature of data sources, compute resources 11
IBM India Research Lab Distributed Data Management & Grid Computing Transparent, Optimized, Integrated Access to Heterogeneous Data at Lower TCO Discover & Leverage Resources • System • Information OGSA, Web Services for Qo. S Dynamic Federation Information Dissemination Data Placement Federated Query OGSA Data Replication Distributed Query Federation FTP, ETML Data Movement Data Driven Application Parallelism Services Oriented Architectures MPP Parallelism SMP Parallelism Monolithic Application Architectures Transaction Parallelism Open Standards Parallelism 12
- Slides: 12