19 Futures

CS 345 Spring 2009 Projects Avg 90.8, Stdev 7.5 20 18 16 Number of Students 14 12 10 8 6 4 2 0 60 65 70 75 80 85 ...

3 downloads 162 Views 2MB Size
CS 345 Spring 2009 Projects Avg 90.8, Stdev 7.5 20 18 16

Number of Students

14 12 10 8 6 4 2 0 60

65

70

75

80

85

90

95

100

Percentage Dr. Philip Cannata

1

Programming Languages Project Presentation –

Flite by Nirav Savjani and Vladimir Chernis

Possible Futures –



Parallel Computing •

Austin Mcgookey video



Orc demo by Nathan Wetzler

Propositional Induction •



Dr. Philip Cannata

ACL2 demo by Nathan Wetzler

Semantic Web

2

Parallel Computing Abstract: In December 2006 we published a broad survey of the issues for the whole field concerning the multicore/manycore sea change (see view.eecs.berkeley.edu). We view the ultimate goal as being able to productively create efficient, correct and portable software that smoothly scales when the number of cores per chip doubles biennially. This talk covers the specific research agenda that a large group of us at Berkeley are going to follow (see parlab.eecs.berkeley.edu) as part of a center funded for five years by Intel and Microsoft. To take a fresh approach to the longstanding parallel computing problem, our research agenda will be driven by compelling applications developed by domain experts in personal health, image retrieval, music, speech understanding and browsers. The development of parallel software is divided into two layers: an efficiency layer that aims at low overhead for 10 percent of the best programmers, and a productivity layer for the rest of the programming community-including domain experts-that reuses the parallel software developed at the efficiency layer. Key to this approach is a layer of libraries and programming frameworks centered around the 13 design patterns that we identified in the Berkeley View report. We rely on autotuning to map the software efficiently to a particular parallel computer. The role of the operating systems and the architecture in this project is to support software and applications in achieving the ultimate goal. Examples include primitives like thin hypervisors and libraries for the operating system and hardware support for partitioning and fast barrier synchronization. We will prototype the hardware of the future using field programmable gate arrays (FPGAs) on a common hardware platform being developed by a consortium of universities and companies (see http://ramp.eecs.berkeley.edu/ ).

Dr. Philip Cannata

3

Parallel Computing From http://www.macs.hw.ac.uk/~dsg/gph/docs/4.06/users_guide/concurrent-and-parallel.html

import Parallel nfib :: Int -> Int nfib n

| n <= 1 = 1 | otherwise = par n1 (seq n2 (n1 + n2 + 1)) where

n1 = nfib (n-1) n2 = nfib (n-2)

Dr. Philip Cannata

4

Parallel Computing The same code as the GPH fibonacci in ORC. def nfib(n) = if (n <= 1) then 1 else nfib(n-2) + n1
5

Semantic Web

Dr. Philip Cannata

6

Dr. Philip Cannata

7

Semantic Web - caBIG Abstract: 21st century biomedical research is driven by massive amounts of data: automated technologies generate hundreds of gigabytes of DNA sequence information, terabytes of high resolution medical images, and massive arrays of gene expression information on thousands of genes tested in hundreds of independent experiments. Clinical research data is no different: each clinical trial may potentially generate hundreds of data points of thousands of patients over the course of the trial. This influx of data has enabled a new understanding of disease on its fundamental, molecular basis. Many diseases are now understood as complex interactions between an individual's genes, environment and lifestyle. To harness this new understanding, research and clinical care capabilities (traditionally undertaken as isolated functions) must be bridged to seamlessly integrate laboratory data, biospecimens, medical images and other clinical data. This collaboration between researchers and clinicians will create a continuum between the bench and the bedside-speeding the delivery of new diagnostics and therapies, tailored to specific patients, ultimately improving clinical outcomes. To realize the promises of this new paradigm of personalized medicine, healthcare and drug discovery organizations must evolve their core processes and IT capabilities to enable broader interoperability among data resources, tools, and infrastructure-both within and across institutions. Answers to these challenges are enabled by the cancer Biomedical Informatics GridT (caBIGT) initiative, overseen by the National Cancer Institute Center for Biomedical Informatics and Information Technology (NCI-CBIIT). caBIGT is a collection of interoperable software tools, standards, databases, and grid-enabled computing infrastructure founded on four central principles: . Open access; anyone-with appropriate permission-may access caBIGT the tools and data . Open development; the entire research community participates in the development, testing, and validation of the tools . Open source; all the tools are available for use and modification . Federation; resources can be controlled locally, or integrated across multiple sites caBIGT is designed to connect researchers, clinicians, and patients across the continuum of biomedical research-allowing seamless data flow between electronic health records and data sources including genomic, proteomic, imaging, biospecimen, pathology and clinical information, facilitating collaboration across the entire biomedical enterprise.

Dr. Philip Cannata

caBIGT technologies are widely applicable beyond cancer and may be freely adopted, adapted or integrated with other standards-based tools and systems. Guidelines, tools and support infrastructure are in place to facilitate broad integration of caBIGT tools, which are currently being deployed at more than 60 academic medical centers around the United States and are being integrated in the Nationwide Health Information Network as well. For more information on caBIGT, visit http://cabig.cancer.gov/ 8

Semantic Web - caBIG

Dr. Philip Cannata

9

Semantic Web

Click here for PDF Dr. Philip Cannata

10

Semantic Web – RDF and SPARQL

Dr. Philip Cannata

11

Semantic Web – RDF

Dr. Philip Cannata

12

Semantic Web – Oracle RDF Example – Create Triples

-- John is the father of Suzie. INSERT INTO family_rdf_data VALUES (1, SDO_RDF_TRIPLE_S('family', 'http://www.example.org/family/John', 'http://www.example.org/family/fatherOf', 'http://www.example.org/family/Suzie'));

Click here to see all “Triples” – Appendix 1

-- John is the father of Matt. Like Prolog? INSERT INTO family_rdf_data VALUES (2, SDO_RDF_TRIPLE_S('family', 'http://www.example.org/family/John', 'http://www.example.org/family/fatherOf', 'http://www.example.org/family/Matt'));

Dr. Philip Cannata

13

Semantic Web – Oracle RDF Example – Create Rules

Rules: -- A father is male. INSERT INTO family_rdf_data VALUES (28, SDO_RDF_TRIPLE_S('family', 'http://www.example.org/family/fatherOf', 'http://www.w3.org/2000/01/rdf-schema#domain', 'http://www.example.org/family/Male'));

Click here to see all complete RDF database

INSERT INTO mdsys.rdfr_family_rb VALUES( Like Prolog? 'grandparent_rule', '(?x :parentOf ?y) (?y :parentOf ?z)', NULL, '(?x :grandParentOf ?z)', SDO_RDF_Aliases(SDO_RDF_Alias('','http://www.example.org/family/'))); Dr. Philip Cannata

14

Semantic Web – Oracle RDF Example – Query

-- Select all grandfathers and their grandchildren from the family model. -- Use inferencing from both the RDFS and family_rb rulebases. SELECT x grandParent, y grandChild FROM TABLE(SDO_RDF_MATCH( '(?x :grandParentOf ?y) (?x rdf:type :Male)', SDO_RDF_Models('family'), SDO_RDF_Rulebases('RDFS','family_rb'), SDO_RDF_Aliases(SDO_RDF_Alias('','http://www.example.org/family/')), null));

http://www.example.org/family/John http://www.example.org/family/Cindy http://www.example.org/family/John http://www.example.org/family/Tom http://www.example.org/family/John http://www.example.org/family/Cathy http://www.example.org/family/John

Like Prolog? Dr. Philip Cannata

15

Semantic Web – RDF

Namespace goes here

Like Prolog? Dr. Philip Cannata

16

Semantic Web – Namespaces or Vocabularies or Ontologies or Semantics

Click here for full namespace

Like Prolog?

Dr. Philip Cannata

17

Semantic Web – RDFS and OWL Namespaces or Vocabularies or Ontologies or Semantics

Dr. Philip Cannata

18

Semantic Web – RDF and RDFS Example

Like Prolog?

Dr. Philip Cannata

19

Semantic Web – RDFS and OWL Example

Like Prolog?

Dr. Philip Cannata

20

Semantic Web – SQL Type Query

Like Prolog?

Dr. Philip Cannata

21

Semantic Web – SPARQL

Like Prolog?

Dr. Philip Cannata

22

Good to know!

Probably in the (short?) future the standard query interface for applications will move from a standard "relational" view to a more complex but less theoretical "semantic" view: from SQL to SPARQL, with all the RDF/RDFS/OWL metadata of top of the raw data. And to follow the vision of a "semantic web", the applications and the data will not be bound together as we are used now, but will both live in the Internet cloud. Regards, Danilo. Bernard Horan ha scritto: (Taken from Oracle's publicly-available project list) Semantic Web Database Technologies Primarily in New England Development Center The vision of Tim Berners Lee, the father of the World Wide Web, is a Web that becomes continually smarter. It begins to understand the meaning of the content inside Web pages, both individually and in the aggregate. In essence, the Web becomes the repository and interpreter of knowledge. The so-called semantic Web will evolve over years and decades, and the process has just begun. Languages and technologies, such as RDF, RDFS, OWL, and SPARQL, have been introduced to standardize the vocabulary and representation of knowledge. The Oracle database is now imbued with the fi rst traces of this semantic Web vision. The database can be treated as a repository of knowledge, with native support for RDF triples, RDFS/OWL rules, user-defi ned rules, native inferencing, and querying with a SPARQL-like language embedded within SQL. An RDF knowledge store now inherits the scalability, robustness, and reliability of a database system. Additionally, enterprise databases will now accept requests that rely on the interpretation of ontologies to fully understand a query. This is just the beginning. We need to work on dynamic inferencing, security, data provenance, analytic tools, visualization of knowledge, better performance for all features, integration of external systems, and many more areas. By combining the power and capabilities of the Oracle database system with the most advanced semantic tools and technologies available elsewhere, we hope to create a system that hastens the advancement and acceptance of the semantic Web. For development of semantic technologies, we seek developers with familiarity in World Wide Web technologies, as well as an understanding of description logic, inferencing, and knowledge extraction, representation and visualization. Development takes place primarily in the New England Development Center. -Danilo Poccia Senior Systems Engineer Sun Microsystems Italia S.p.A. Via G. Romagnosi, 4 Roma 00196 ITALY Phone +39 06 36708 022 Mobile +39 335 6983999 Fax +39 06 3221969 Email [email protected] Blog http://blogs.sun.com/danilop

Dr. Philip Cannata

23

Dr. Philip Cannata

24

Semantic Web in Austin: http://www.semanticwebaustin.org/

http://juansequeda.blogspot.com/

Juan Sequeda, Ph.D Student Research Assistant Dept. of Computer Sciences The University of Texas at Austin http://www.cs.utexas.edu/~jsequeda [email protected] http://www.juansequeda.com/ Dr. Philip Cannata

25

http://www.w3.org/People/Ivan/CorePresentations/SWTutorial/Slides.pdf Dr. Philip Cannata

26

http://www.w3.org/2007/Talks/0202-Gijon-IH/Slides.pdf Dr. Philip Cannata

27

Potential Research Project WEB DATABASE (WDB): A JAVA SEMANTIC DATABASE by Bo Li A thesis submitted in partial fulfillment of the requirements for the degree of Bachelor of Science In Computer Sciences: Turing Scholars Honors University of Texas at Austin Spring, 2006

java

CLASS Person "Persons related to the company” ( person-id : INTEGER, REQUIRED; first-name : STRING, REQUIRED; last-name : STRING, REQUIRED; home_address : STRING; zipcode : INTEGER; home-phone "Home phone number (optional)" : INTEGER; us-citizen "U.S. citizenship status" : BOOLEAN, REQUIRED; spouse "Person's spouse if married" : Person, INVERSE IS spouse; children "Person's children (optional)" : Person, MV (DISTINCT), INVERSE IS parents; parents "Person's parents (optional)" : Person, MV (DISTINCT, MAX 2), INVERSE IS children; );

Dr. Philip Cannata

Complete and Expand

SI M

View Thesis

javacc

This will enable SIM and SPARQL queries against the data – possibly great for Semantic Data Integration

Replace this with RDF Triple Store 28