Object Web Architectures Portals P 2 P XML
Object Web Architectures Portals P 2 P XML ERDC Gateway Tutorial Geoffrey Fox IPCRES Laboratory for Grid Technology Computer Science, Informatics, Physics Indiana University Bloomington IN fox@csit. fsu. edu 9/16/2021 erdcgridportalaug 01 1
Computational Grids Survey A brief introduction to computational grid projects and goals. 9/16/2021 erdcgridportalaug 01 2
What Is a Computational Grid? • Grids link distributed scientific resources. – Resources can be geographically, politically distributed • Goal: provide means for sharing resources between organizations. • Example “high-end” resources: – Supercomputers and clusters – Mass storage – Advanced visualization (CAVES) and collaboration (Access Grid). – Particle colliders, telescopes, earthquake detectors • www. globus. org/research/papers/anatomy. pdf 9/16/2021 erdcgridportalaug 01 3
What Does a Grid Need? • Multi-institutional security – PKI or Kerberos • Information services – Manage, store, deliver information about resources. – Use information to make decisions • Scheduling and Queuing – Advance reservation – Meta-queuing • Remote execution, file transfer, monitoring 9/16/2021 erdcgridportalaug 01 4
Example of a Grid Problem: CERN’s Large Hadron Collider • Goes on-line in 2005 • Will generate petabytes of raw, distributed data, terabytes of event summary data. • Computing resources for data analysis will be distributed between CERN and regional centers spread all over the world • 1500 -2000 people will collaborate on experiments. 9/16/2021 erdcgridportalaug 01 5
Grid Projects • Grid Infrastructure – Condor: www. cs. wisc. edu – Globus: www. globus. org – Legion: www. cs. virginia. edu/~legion • Grid Applications – Netsolve: www. cs. utk. edu/netsolve – Ninf: www. etl. go. jp • Global Grid Forum: www. gridforum. org 9/16/2021 erdcgridportalaug 01 6
Examples of Deployed Grids • NASA’s Information Power Grid – Links NASA’s Ames, Glenn, and Langley Centers. – Launch. Pad currently available – www. ipg. nasa. gov • DOE’s ASCI Distributed Resource Management – Links classified computing resources at Lawrence Livermore, Los Alamos, and Sandia National Labs. – Full deployment scheduled by Nov 2001. 9/16/2021 erdcgridportalaug 01 7
Latest Grid News • NSF will spend $53 million on the Distributed Terascale Facility (DTF) – 13. 6 teraflops, 600 terabytes, 40 Gigabit/sec – DTF sites: NCSA, SDSC, Argonne, Cal. Tech – Industry partners: IBM, Intel, Qwest • See www. ncsa. uiuc. edu/News/Access/Releases for more information (August 9). 9/16/2021 erdcgridportalaug 01 8
Distributed Objects • Examples of current object technologies – Documents -- URL – "General Programs including database invocations" • Old Style Web -- CGI • New Style Web -- XML • CORBA and COM -- special "interface definition language" (IDL) defines invocation in C++ like syntax • RMI uses Java language as IDL language • Benefits of distributed objects – allows objects written in different languages to communicate seamlessly via standardized messaging protocols embodied by middleware. – Higher levels of transparency of interoperability – Objects can be “self-managing” of resources – provides flexible grain of decomposition for building complex systems 9/16/2021 erdcgridportalaug 01 9
Distributed Object Web Technology Model • Basic Vision: Merge Web and Distributed Objects • E. g. Need to abstract entities (Web Pages, database entries, simulations) and services as objects with methods(interfaces) – CORBA. . XML is “just” CGI done right • COM(Microsoft) and CORBA(world) are competing cross platform and language object technologies • Javabeans plus RMI and perhaps JINI is 100% pure Java distributed object technology • W 3 C says you should use XML which defines a better IDL and with Schema an object specification model and SOAP an Object access model 9/16/2021 erdcgridportalaug 01 10
3 -Tier Architecture and Different Object Models • There are several important Object Models: COM, CORBA, Java, Web, Oracle Database …… • But it doesn’t matter!! Object Repository XML File System (Web Site) Request Or Export/Import Information Middle Tier “Business Logic” dissociates User and Back End 9/16/2021 erdcgridportalaug 01 Database 11
Emerging Object Web Multi-Server Model Clients and their servers 9/16/2021 Middle Tier Custom Servers erdcgridportalaug 01 Back End Servers and their services 12
Computational Science Grid: Multi-Server Web Computing System Portals are user Interfaces to a Grid Parallel DB The World Wide Web is a big Proxy Grid Multidisciplinary ) P 2 P Networks. Cont include Grids rol (Web. Flow NEOS Control Optimization Portal Control 9/16/2021 Optimization Service Origin 2000 Proxy Agent-based Choice of Compute Engine Portals Database Net. Solve Linear Alg. Server MPP Matrix Solver IBM SP 2 Proxy Data Analysis Server erdcgridportalaug 01 The Grid MPP 13
Global Grid Forum 9/16/2021 erdcgridportalaug 01 14
Computational Grids • Exploit the analogy with electricity – make using a computer as natural as plugging an appliance (PDA, PC) into a wall socket • Make the ensemble of computers, storage devices, scientific instruments on the web “seamlessly accessible” • Link components of the grid together to solve a single problem – Clusters, metacomputers • There are computational grids, education grids, information grids, shopping grids etc. – The web is a (information) grid • Everything is an object • Generic access implies standards for API’s and protocols and services • USC (ISI Carl Kesselmann) and Argonne (Ian Foster) pioneered grids 9/16/2021 erdcgridportalaug 01 15
Issues for Grids and hence Portals • Are the grid components pretty much fixed – such as giant ASCI supercomputers • Are they fleeting and mobile such as internet connected cell phones – The set of IP enabled home sensors, appliances and controllers is a grid • What are requirements? – anonymity, performance Security, , ease of use … • Different components and requirements implies that not likely to be just one grid but a federation of interoperable grids • What are the “standards” and who sets them • How do universities build grids they care about on graduate time while industry builds and abandons remarkable technologies on Internet time 9/16/2021 erdcgridportalaug 01 16
Foster’s Grid architecture • What is difference between protocol (SOAP, HTTP) and Application interface (HTML, MIME) 9/16/2021 erdcgridportalaug 01 17
ASCI Grid • Link the multi teraflop computers of ASCI together – today 12, 3 and 2 teraflops. By 2005 100, 60 and 20 teraflops 9/16/2021 erdcgridportalaug 01 18
IPG Architecture 9/16/2021 erdcgridportalaug 01 19
Information Power Grid Led by NASA Ames 9/16/2021 erdcgridportalaug 01 20
Experimental Particle Physics Grid 9/16/2021 erdcgridportalaug 01 21
Earthquake Engineering Grid • Links Experimental Facilities, Compute 9/16/2021 resources, erdcgridportalaug 01 22
Commodity Portals are Web Interfaces for Consumers Yahoo, Net. Center, Amazon. com, Ebay. com etc. are portals for e-commerce, news etc. We want to use these ideas in building computer interfaces 9/16/2021 erdcgridportalaug 01 23
Hierarchy of Portals and Their Technology Generic Portals Collaboration Universal Access Security ……. Generic Services • Portal Building Tools and Frameworks (XML, i. Planet, Portlets, www. desktop. com) User customization, component libraries, fixed channels Information Services Databases ……. Quizzes Grading. . . Grid Services Visualization. . . Enterprise Portals Education Services Education and Training Portals Compute Services Math. ML etc Science Portals ……. . . K-12 9/16/2021 University Biology erdcgridportalaug 01 Chem Egy 24
Services in Any Grid Application • • Security Fault Tolerance Object Lookup and Registration Object Persistence and Database support Event and Transaction Services Information Services Collaboration among users – Teachers and Students (Centra) – Market lead and Salespeople (Webe. X) 9/16/2021 erdcgridportalaug 01 25
Further Services in Computational Grids • Job Status • File Services (as in NPACI Storage Resource Broker) – Support (XML based) computational science specific metadata like Math. ML, XSIL • Visualization • Programming, Debugging, Performance Monitoring • Application Integration (chaining services viewed as backend compute filters) can be called Workflow • “Seamless Access” and integration of resources between different users/application domains • Job Scheduling (Condor) and special operating modes such as multitude of parameter search jobs • Parameter Specification Service (get data from Web form into Fortran program wrapped as backend object) • High Performance for general services 9/16/2021 erdcgridportalaug 01 26
Web Computing and P 2 P • Pleasingly (embarrassingly) parallel applications involvement the management of multiple jobs running on separate largely independent parts • Some Monte Carlo calculations and parameter searches – Also fancy number theoretic applications such as cracking of RSA security • Here we see “use of idle cycles” and similar job scheduling issues • Many have noticed value of Web for this and this is sometimes called P 2 P or peer-to-peer computing as involves Peers on edge of Internet – not monster servers in middle • Note total power of Web is around one thousand times that of most powerful supercomputer but how much can be harnessed? 9/16/2021 erdcgridportalaug 01 27
P 2 P for Distributed Computing or Web Computing I • The P 2 P applications are highlighted by the use of millions of Internet clients to analyze data looking for extraterrestrial life (SETI@home http: //setiathome. ssl. berkeley. edu/ ) and the • Newer project examining the folding of proteins ( Folding@home http: //www. stanford. edu/group/pandegroup/Cosm/ ). • These are building distributed computing solutions for a special class of pleasingly or embarrassingly parallel applications: – Those that can be divided into a huge number of essentially independent computations, and a central server system doles out separate work chunks to each participating client. • This approach is called P 2 P because the computing is Peer based even though it does not have the "Peer only communication" characteristic of P 2 P information systems like Gnutella and Napster. • SETI@home and Folding@home are elegantly implemented as screen savers that you download. 9/16/2021 erdcgridportalaug 01 28
Parabon • Pure Java model • Ensures Security 9/16/2021 erdcgridportalaug 01 29
Entropia Financial Modeling I 9/16/2021 erdcgridportalaug 01 30
Entropia Financial Modeling II • Each basic financial instrument can be calculated independently • Central Server interprets the total simulation • Make Money or Learn what causes market swings or …. 9/16/2021 erdcgridportalaug 01 31
Drug Structure Simulations 9/16/2021 erdcgridportalaug 01 32
United Devices also does Drug Simulation • Parameter Study: do billions of simulations – each with different parameters • Search Engine like interface to simulation • Works as each calculation fits in a PC – a detailed molecular model would usually not do this 9/16/2021 erdcgridportalaug 01 33
Performance of Entropia Network 9/16/2021 erdcgridportalaug 01 34
P 2 P for Distributed Computing or Web Computing II • Other projects of this type include: • United Devices (http: //www. ud. com/home. htm based on SETI@home), • Applied. Meta (http: //www. appliedmeta. com based on well known Legion project from the University of Virginia), • Parabon computation (http: //www. parabon. com), • Condor (from Wisconsin http: //www. cs. wisc. edu/condor/) and • Entropia (http: //www. entropia. com/). • Other applications for this type of system include financial modeling, bio-informatics, measurement of web server performance and the scheduling of different jobs to use idle time on a network of workstations. • Ian Foster has given a more detailed review of these activities at http: //www. nature. com/nature/webmatters/grid. html and related them to computational grids (http: //www. gridforum. org). 9/16/2021 erdcgridportalaug 01 35
Learning Management Grid from Do. D ADL= Advanced Distributed Learning Learnin Content External systems: “Learning g Management Server(s) HR, E-Commerce, ERP. . . Server System” LMS Migration Adapter Course Interchange: Course Structure Format (CSF), Metadata Services or Adapter Learning Server Adapter Server Side Client Browser API Adapter 9/16/2021 Application HTML+ erdcgridportalaug 01 Common Grid Services & Objects Runtime Environment: Launch, API, Data Model www. adlnet. org 36
Properties of Educational Objects • Metadata from IEEE and IMS – Roughly Properties of educational objects thought of as “documents” (author, title …) All Grids • Course Packaging from ADL and IMS – How to form bigger (educational) objects from smaller objects • Enterprise Properties from IMS – Link to people (users) and organization databases (rather incomplete at present but must be important as probably can agree) • Tests and Quizzes from IMS • Specialized descriptors from ADL – Such as objectives, prerequisites, completion requirements 9/16/2021 erdcgridportalaug 01 37
Education Specific Portal Services • Administrative Structure – degrees, departments, lecturers, Deans. . . • Performance (grading) information • Homework submission • Quizzes of various types (multiple choice, random parameters) • Assessment data and an analysis • Hierarchical Curriculum structure from document fragment to page to lecture to course • Napster/Gnutella type P 2 P distributed information system with personalized dynamic collections (analogy between CDROM of pirated music and dynamic lectures/personal info resource as in Real. Jukebox) 9/16/2021 erdcgridportalaug 01 38
Some Science Portals and Services: Gannon Project Supported Grid Services Grid Standards JS JM IS FM AA CT SC EJ Gateway X X X X Mississippi X X X X Unicore X X X X Hot Page X X X X X Indiana Nimrod Cactus 9/16/2021 X X X X X JS Job Submission JM Job Kerb/GSI, MDS, Management Events, Collab, app Gram, CORBA, resource management e. g. File Staging EJB Kerb/GSI, Gram, Event, Services IS Information GIS, CORBA. Data access services FM File Management GSI, GIS, Abstract Job AA Authorization Scheduling Metadata Grid Accounting, GSI, GIS, Gram, and Accounting portal-to-portal protocols, SRB CT Composition cert/key repository GSI, MDS, Gram, Events, app schema SC Scripting GSIFTP, Co. G standards, RMI EJ Job Journaling Globus, Legion, Resource Auctions & used or will use Condor X X X Needed Services Globus via GPDK erdcgridportalaug 01 Allocations Resource Brokers 39
Some Science Portals and Services: Gannon Project JS Standards Job Submission Supported Grid Services Grid Needed Services JS JM IS FM AA GPDK-LBL X X X Co. G-ANL X X X Ji. PANG/Ninf X X X ECCE+ELN X X X X IPG Launch. Pad X Lattice X X X Discover X X X 9/16/2021 CT SC EJ used will use JMor. Job Management Globus via Co. G X X X e. g. File Staging IS Information Services Globus, CORBA Software Installation FM File Management Co. G, Jini, Ninf, Events AA Authorization Netsolve GSI, GIS, and Accounting Events Meta-scheduling CT Composition GSI, GIS via Events SC Scripting GPDK EJ Job Journaling Open. SSL, x. 509 CORBA erdcgridportalaug 01 40
- Slides: 40