SMART System Model Acquisition from Requirements Text BPM

SMART System Model Acquisition from Requirements Text BPM 2004 Potsdam, Germany, June 17 -18, 2004 Dov Dori Nahum Korda Avi Soffer Shalom Cohen Technion – Israel Institute of Technology SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

Introduction Freely expressed ideas, Concepts, and intentions Formal System Specification System Inception • Transformation of free-format business and user requirements into a formal system specification is a complex and laborious operation. • The clutter of details prevents seeing ‘the big picture’ and focusing on the system goals SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

SMART Processing • The challenge: convert free natural language requirements text into a formal, machine-processable model • The SMART idea: acquire the system model through an automated process based on extracting semantics from free-format text Freely expressed ideas, Concepts, and intentions SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

Two required SMART technologies: 1) Information Extraction Technology, capable of: § Identifying elements that are key concepts for the domain and problem at hand § Deriving a semi-formalized representation of the underlying documentation. 2) A system modeling environment, capable of: § Human-oriented intuitive expression of complex system structure and behavior § Formalism that allows machine processing SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

What is OPM Object-Process Methodology? A comprehensive paradigm for – modeling – engineering – lifecycle support of complex, multi-disciplinary systems SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

OPM’s Building Blocks are Things: Objects and Processes Object Process SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

Entities: Objects, Processes, States • Objects and processes are two • • types of equally important things (entities) required to describe a system in a single, unifying model At any point in time, each object is at some state Object states are transformed through the occurrence of a process SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

A process changes an object’s state SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

A process generates a new object SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

OPM has a single model with a bimodal representation A single diagram type: Object-Process Diagram (OPD) A corresponding subset of language: Object-Process Language (OPL) SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

Resources: OPM book Dov Dori Object-Process Methodology - A Holistic Systems Paradigm, Springer Verlag, Berlin, Heidelberg, New York, 2002 SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

Resources: OPM Website www. Object. Process. org Free OPCAT Publications SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology downloads

The SMART System Framework OPD Input OPL Output System Architecting Team handles System Model Acquisition requires System Requirements In Natural Language. System Model Acquisition yields System Model. SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

OPCAT – Object-Process CASE Tool Tree OPD OPL SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

SMART - System Diagram SMART Categorization Engine System Requirements Unstructured Text OPL Generator System Model Acquisition OPCAT System Architecting Team System Model SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

System Model Acquisition Process In-zoomed SMART System Model Acquisition System Requirements Unstructured Text Category Extraction Categorization Engine Category List raw System Architecting Team edited List Editing Relation Set Relation Formulating OPL Generator OPL Sentence Generating OPL Sentence Set OPCAT OPD Constructing System Model SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

SMART – Procedural Steps 1. Automatic Extraction of Categories from Unstructured Text 2. Manual Manipulation of Categories 3. Automatic Search of OPM Relations 4. Automatic Generation of OPL Sentences 5. Manual Enhancement of the Results Free-Format Text SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

1. Automatic Extraction of Categories from Unstructured Text • Categories = idiomatic phrases (word sequences) – Reflect the underlying topics in a given corpus of documents • Categorization engine based on heuristics – Implemented in Common LISP • Can combine external – Ontologies – Taxonomies – Thesauri SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

2. Manual Manipulation of Categories • Selecting categories that can be things in the OPM model • Classifying them as either objects or processes • Clustering of alternative formulations for the selected OPM things based on their semantic similarity • Option for manually adding OPM things that did not show up among the extracted categories SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

3. Automatic Search of OPM Relations • Utilizes a set of configurable, predefined templates: – Template consists of two things and the relation between them, expressed in alternative ways – Utilizes second order regular expressions defined on any lexical or grammatical attribute (part‑of‑speech, capitalization, punctuation) • Finite‑state automaton that operates on a suffix‑tree index consisting of tokens • Word-based: Instead of comparing character strings, compares word sequences SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

4. Automatic Generation of OPL Sentences • Every extracted natural language sentence is s translated into OPL • Reformulation of outcome to better reflect the underlying relations: – Custom relations transformed into processes (cached into => Caching) – Complex relations transformed into two equivalent simple sentences: Actual Documents cached into Document Repositories. => • Caching requires Actual Documents. • Caching yields Document Repositories. • Transformations maintain original underlying semantics of the NL sentence SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

5. Manual Enhancement of the Results • Non-semantic corrections – extraction did not depict all of the existing or implied relations • Additions and eliminations - semantically modify original output • Scaling applied to simplify results without losing details SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

The SMART Experiment • Demonstrates the feasibility of automating the most critical step in the system engineering process. • Based on GRACE (Grid Retrieval And Categorization Engine), a European Community Information Society Technology (IST) project. • Designed as proof-of-concept, offering hands-on experience required for the development of a future full-scale industrial application. SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

SMART Benefits • Reduces the initial level of conceptual complexity when starting to build a system • Significantly reduces the quantity of material that needs to be processed manually • Graphic manipulation (in OPD) much easier than text editing • Quality, accuracy, and conciseness of the system architecture is higher due to the discipline OPM introduces at an early stage • Capable of automatic generation of UML diagrams SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

The Future of SMART • Component in 6 th Framework EEC IST-202507126 COCOON (Building Knowledge-driven and Dynamically Networked Communities within European Healthcare Systems) • Further R&D: more sophisticated extraction templates, increased level of automation, improved performance • Commercial pilot application planned SMART - System Model Acquisition from Requirements Text Technion – Israel Institute of Technology
- Slides: 25