Multivariate Data Visualisation Benjamin RadburnSmith Manchester 050110 BENJAMIN
Multivariate Data Visualisation Benjamin Radburn-Smith Manchester 05/01/10 BENJAMIN RADBURN-SMITH DATA VISUALISATION MANCHESTER 05/01/10
Outline Visualisation Techniques Parallel Coordinates Grand Tour My 2009 CSC 2009 RAL HEP Summer School 2009 My Program Future work To the program Physics BENJAMIN RADBURN-SMITH DATA VISUALISATION MANCHESTER 05/01/10 0
Conventional Visualisations 2 D Cartesian plot (Scatter Plot) Histograms Some 3 D plots using colour/graphics BENJAMIN RADBURN-SMITH DATA VISUALISATION MANCHESTER 05/01/10 1
Parallel Coordinates M. d'Ocagne in 1885 Reinvented by A. Inselberg (Tel-Aviv University) in 1985 Point ↔ line duality ` BENJAMIN RADBURN-SMITH DATA VISUALISATION MANCHESTER 05/01/10 2
Parallel Coordinates Align axes parallel to each other → a planar diagram Not constrained to 2 or 3 dimensions each ndimensional point has a unique representation Eg one data instance in this representation: BENJAMIN RADBURN-SMITH DATA VISUALISATION MANCHESTER 05/01/10 3
Parallel Coordinates E. Wegman (George Mason University) Work on both parallel coordinates and grand tours (inc. axes comaprisons) Crystal. Vision http: //www. newton. ac. uk/programmes/SCH/seminars/010714001. html Parallel Coordinate Density Plots use transparency to tackle the problem of over-plotting Linked brushing/deletion with other plots eg Cartesian to get the best of both worlds. BENJAMIN RADBURN-SMITH DATA VISUALISATION MANCHESTER 05/01/10 4
Parallel Coordinates Uses and advantages: Search for patterns in the data Try to separate out signal from background Give information relating to which data mining algorithms would work well on the dataset (NN, SVM etc) Can explore data which is both high-dimensional and massive in size Get an overview of the data and also focus on details on the same plot BENJAMIN RADBURN-SMITH DATA VISUALISATION MANCHESTER 05/01/10 5
Grand Tour D. Asimov (SLAC 1983) “a method for viewing multivariate statistical data via orthogonal projections onto a sequence of two-dimensional subspaces. The sequence of subspaces is chosen so that it is dense in the set of all two-dimensional subspaces” View 2 D projections (scatterplots) of n-dimensional data (combination of axes) Rotation of coordinate axis n-dimensional space Animation: determined by a space-filling curve through all posible orientations of 2 D projection in n. D space BENJAMIN RADBURN-SMITH DATA VISUALISATION MANCHESTER 05/01/10 6
Parallel Coordinates/Grand Tour Couple the two together → the problem of axis ordering disappears Manifold of 2 -planes becomes manifold of kplanes in p-dimensional space (k≤p) Can do partial tours by deselecting axes As shown in Crystal. Vision: ftp: //www. galaxy. gmu. edu/pub/software/Crystal. Vision. exe Hopefully in my program soon (more details later) BENJAMIN RADBURN-SMITH DATA VISUALISATION MANCHESTER 05/01/10 7
My 2009 Went to RAL: joined Particle Physics Department and e-Science Vis. Cluster: 17 nodes each with 2 2 GHz CPU's, 8 GB RAM, Nvidia FX 4500, 2 250 GB hdd Went on a few C++ and python courses Went to 2 schools. . . BENJAMIN RADBURN-SMITH DATA VISUALISATION MANCHESTER 05/01/10 8
CSC 2009 BENJAMIN RADBURN-SMITH DATA VISUALISATION MANCHESTER 05/01/10 9
CSC 2009 Held in Göttingen, Germany 2 weeks in August Covered the following topics: Software engineering and tools (cppunit, cvs etc) Cryptography (PKI, Kerberos etc) Security Computer Architecture & Performance tuning (SIMD/SSE, parallelism, tools etc) Networking Qo. S & Performance Virtualisation Physics computing ROOT technologies (CINT, ACLi. C, PROOF etc) Data analysis (multivariate analysis, statistics etc) BENJAMIN RADBURN-SMITH DATA VISUALISATION MANCHESTER 05/01/10 16 10
RAL HEP Summer School Held in Oxford, UK 2 weeks in September Covered the following topics: QFT QED/QCD SM Phenomenology BENJAMIN RADBURN-SMITH DATA VISUALISATION MANCHESTER 05/01/10 11
My Program Released the prototype in November Works on the Vis. Cluster Works on Linux machines with decent hardware and OS BENJAMIN RADBURN-SMITH DATA VISUALISATION MANCHESTER 05/01/10 12
My Program BENJAMIN RADBURN-SMITH DATA VISUALISATION MANCHESTER 05/01/10 13
Presentations In 2009: Also gave presentations to: the Viz. NET conference my e-Science group, and my Particle group. BENJAMIN RADBURN-SMITH DATA VISUALISATION MANCHESTER 05/01/10 14
In the Year 2010 Finish the program – adding the grand tour, Qt dynamic functions, talk to ntuples directly(? ), improve data structure Give lectures to i. CSC 2010 More presentations Perhaps(!) work with Ne. XT institute, Manchester(? ), TMVA, ROOT Particle Physics using the program. Work with data from an experiment: general purpose experiment at the LHC. . . BENJAMIN RADBURN-SMITH DATA VISUALISATION MANCHESTER 05/01/10 15
CMS CMS Exotics group at RAL Minimal B-L extension to the SM arxiv: 0909. 3113 v 1 SU(3)C x SU(2)L x U(1)Y x U(1)B - L Z' with a mass, for example, of >1 Te. V Long lived heavy neutrinos BENJAMIN RADBURN-SMITH DATA VISUALISATION MANCHESTER 05/01/10 16
Thank You Further information through my website Open research project: www. hep. manchester. ac. uk/u/benjamin Questions? BENJAMIN RADBURN-SMITH DATA VISUALISATION MANCHESTER 05/01/10 17
- Slides: 19