CollSpotting: Big, Beautiful Data Andrew Grant STFC Jean-Marie le Goff CERN Andrew Grant STFC Jean-Marie le Goff CERN.

Slides:



Advertisements
Similar presentations
Technology Roadmap Project Harold Flescher VP-Elect, Technical Activities August 2008, Region 1 Meeting.
Advertisements

BBSRC and data visualisation Head of Policy Evidence
Project number Research Title Your name Your supervisor(s) Industry Champion (if appropriate) University Start enrolment:month/year Expected completion:month/year.
The Small World of Software Reverse Engineering Ahmed E. Hassan and Richard C. Holt SoftWare Architecture Group (SWAG) University Of Waterloo.
Principal Patent Analyst
Funding Networks Abdullah Sevincer University of Nevada, Reno Department of Computer Science & Engineering.
Shou Ray Information Service Co., Ltd.
Academic and Corporate Relations Center. The President welcomes You.
Class 11 Decision Making, Decision Support Systems, & Executive Information Systems MIS 2000Decision Making and Information Systems.
Robert Huggins and Daniel Prokop Centre for International Competitiveness, Cardiff School of Management, University of Wales Institute, Cardiff Presentation.
Collaboration Spotting for Technology Transfer. Technology Transfer  “ active and intentional process to disseminate or acquire knowledge, experience.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Engineering & Physical Sciences Research Council.
Business Plan What is a Business Plan? Defn: “written document containing the guidelines for the business center’s (product/ group of products/
Strengthening the quality of research for policy engagement in the African context – achievements and aspirations Tebogo B. Seleka Botswana Institute for.
Welcome to the December 2014 meeting. Ethos and Outcomes Learning Technologists Leeds - Ethos and Outcomes Collaborative Share attendance and.
1 HiPEAC Workshop on Building Partnerships September 25, 2014 Ljubljana, Slovenia.
A Transnational TTO for ELI Collaboration with CERN J.-M. Le Goff ELI-CTU-CERN meeting ELI, Prague, January 29th-30th, 2014 HEPTech-CERN Collaboration.
Overview Prototyping and construction Conceptual design
Author(s) (Name of student) and their Affiliation (Department/Course/Club, School Name and Address) FUTURE DIRECTIONS RESULTS: ANALYSIS AND IMPLICATIONS.
Principles of Social Network Analysis. Definition of Social Networks “A social network is a set of actors that may have relationships with one another”
Aggregate patterns of linkage of nanotechnology engineering centers with industry Luciano Kay School of Public Policy, Georgia Institute of Technology.
CURRIKI --An Overview Presented to the Bioscience Interest Group Christine Loew Program Manager
The World Wide Web is a great place to find more information about a topic. But there are a lot of sites out there—some are good and some are not so good.
1 Direction scientifique Networks of Excellence objectives  Reinforce or strengthen scientific and technological excellence on a given research topic.
THE IMPORTANCE OF IPR ACROSS THE LIFECYCLE OF INNOVATION Bob Stembridge Principal Patent Analyst, IP & Science.
Sketches and prototypes for the Orlando Six Degrees of Separation Project.
Tracking national portfolios and assessing results Sub-regional Workshop for GEF Focal Points in West and Central Africa June 2008, Douala, Cameroon.
5 July 2012Ganesha Associates1 Basic Skills for Scientific Research and Publishing. Segment 1. Introduction to the course.
+ Big Data, Network Analysis Week How is date being used Predict Presidential Election - Nate Silver –
GET CONNECTED Information Technology Career Cluster.
What’s New in FlyBase EDRC 2015, Heidelberg. Visualising interaction networks.
Date: 2012/08/21 Source: Zhong Zeng, Zhifeng Bao, Tok Wang Ling, Mong Li Lee (KEYS’12) Speaker: Er-Gang Liu Advisor: Dr. Jia-ling Koh 1.
Developing New Journals Alison Mercer Kathryn Wilson.
CEC and its WWW Challenges for the New Year Results Web Survey December 2009 among CEC members Frits Hesselink, Andy Alm 31 December 2009.
Preliminary Survey on forming an Africa-Lics network Erika Kraemer-Mbula & Watu Wamae All African Globelics Seminar on Innovation and Economic Development.
ARCHIVES AND RECORDS MANAGEMENT PROFESSIONAL ASSOCIATION AND JOURNAL ANALYSIS Kim Edwards MARA September 2015.
HCC class lecture 21: Intro to Social Networks John Canny 4/11/05.
Cern.ch/knowledgetransfer. Knowledge Transfer | Accelerating Innovation BE-KT Innovation Day,
Big Data Using Big Data for Cultures and Communities Jeremy Reffin Simon Wibberley CASM, University of Sussex Carl Miller CASM, Demos July 2014.
NETWORKING APAMSA Leadership Development Module. Networking  Networks will involve several people both inside and outside the organization  Ultimate.
WP4 STATUS AND OUTLOOK Hartmut Hillemanns TTN Meeting, 10./11. December 2009.
Scotland’s Colleges is a trading name of both the Scottish Further Education Unit and the Association of Scotland’s Colleges Curriculum for Excellence.
Building Systems for Today’s Dynamic Networked Environments A Methodology for Building Sustainable Enterprises in Dynamic Environments through knowledge.
This project has received funding from the European Union’s Seventh Framework Programme for research, technological development and demonstration under.
Building and Connecting Emerge Jim Hensman – Coventry University George Roberts – Oxford Brookes University Next Generation Technologies in Practice Conference.
White Paper Preliminary Feedback 2 Way Relationship with Business Community.
MEASURING RESEARCHERS: FROM RESEARCH PUBLICATIONS AND COMMUNICATION TO RESEARCH EVALUATION Lucie Vavříková 1.
STFC’s National Laboratories Round Table on Synergies and Complementarity among Laboratories John Womersley Chief Executive, STFC 13 th Pisa meeting on.
1 © 2006 Nokia Innovation and Competitiveness ICT Industry Perspective Lauri Kivinen Vice President, Head of Nokia EU Representative Office, Brussels Budapest,
BI06 THE TIME IS NOW TO GET STARTED WITH MICROSOFT POWER BI James Crowter MVP, Managing Director, Technology Management Sorry downloaders but you’ll have.
WP1 WP2 WP3 WP4 WP5 COORDINATOR WORK PACKAGE LDR RESEARCHER ACEOLE MID TERM REVIEW CERN 3 RD AUGUST 2010 Work Package 1: Pixel detector systems for particle.
D Programme Level Cooperation analysis and evaluation report (DLR) Outline & main findings.
Paulo Cardoso, Internship progress report Internship: Technology Transfer Facilitator Trainee: Paulo Jorge Magalhães Cardoso Supervisor: Jean-Marie.
WEB STRUCTURE MINING SUBMITTED BY: BLESSY JOHN R7A ROLL NO:18.
Presented by Alex Mitchell Joy Mkhasibe
INDUSTRY ALIGNMENT FUND – PRE-POSITIONING PROGRAMME LETTER OF INTENT
INDUSTRY ALIGNMENT FUND – PRE-POSITIONING PROGRAMME LETTER OF INTENT
Institute of Economics, University of Campinas, Brazil.
CSE5544 Final Project Interactive Visualization Tool(s) for IEEE Vis Publication Exploration and Analysis Team Name: Publication Miner Team Members:
INDUSTRY ALIGNMENT FUND – PRE-POSITIONING PROGRAMME LETTER OF INTENT
CSE5544 Final Project Interactive Visualization Tool(s) for IEEE Vis Publication Exploration and Analysis Team Name: Publication Miner Team Members:
Collaboration Spotting: Visualisation of LHCb process data
INDUSTRY ALIGNMENT FUND – PRE-POSITIONING PROGRAMME LETTER OF INTENT
What is €5 billion worth? Magda Gunn, IMI Scientific Project Manager.
E-Commerce Theories & Practices
Current Issues or Challenges in Visual Analytics
2010年度部门内部品牌推广计划 部门: 调研及策略部.
An Efficient method to recommend research papers and highly influential authors. VIRAJITHA KARNATAPU.
The employability network stars
Presentation transcript:

CollSpotting: Big, Beautiful Data Andrew Grant STFC Jean-Marie le Goff CERN Andrew Grant STFC Jean-Marie le Goff CERN

Intro to CollSpotting How does it work? What problem does it solve? Model What’s next?

Developed at CERN by Physicists We developed the program to help us figure out who the key players at the cutting edge of the 100s of research fields CERN is active in are. Realised this could be much more widely applicable – which is where you can help! An FP7 project that addresses infrastructures required for detector development for future particle physics experiments

What is CollSpotting? Software developed at CERN Identifies relationships between institutions and visualises them Visualise clusters, who works with whom and who is active in your field of interest Find closely related topics and hidden connections Powerful data-mining and visualisation algorithms can be expanded to new areas

CollSpotting sifts 720m+ Publications: “Who works with Whom?” In principle, can include any kind of databases where “authorship” can be attributed to different organisations/entities – what else would you like to see here?

How Collaboration Spotting Works Data-mining from patent, publication etc. databases (see last slide) Whose names appear together a lot? Which keywords appear in the same kinds of clusters?

Using Social Network Analysis and Graph Theory to Visualise Complex Relationships Easily Pretty, huh? Assign a value to how correlated each two data points (nodes) are, e.g. “how many papers have these two institutes jointly published?” In a network graph, data points with a large degree of correlation end up clustering together. Additionally: thicker connections (edges) = stronger correlation, larger dots = more prominent data points. Can spot key players and relationships at a glance, detect underlying patterns.

Germanium Interactive: Click on a Node to Highlight its Links Germanium Detectors (key players)

What problems can you solve with it? Identify potential collaborators and competitors. Identify important economic and research clusters Who’s patenting in this space? Where is there still room for me to operate? Assess the strength of your technologies Look for me-too technologies Spot technology trends using timeline What else?

How do people currently spot these connections and trends? Specialist search engines for patents (Thomson Reuters), publications (ISI WoK), unstructured data (Autonomy) Attend conferences and workshops Consultancies to do the leg-work for you There’s currently no easy way to do this!

Some examples Researchers: find relevant collaborators Industry: target less-contested areas for R&D Lawyers: Patent landscapes Investors: Spot opportunities and buyers Basically anyone who wants a rapid, easily digestible summary of who is who in an area of interest and all the hidden links between them.

Micro Pattern Gaseous detectors: 396 publications Weizmann Institute

Micro Pattern Gaseous detectors: 111 patents

Micro Pattern Gaseous detectors: 396 publications (Weizmann)

Micro Pattern Gaseous detectors: All publications; Key players (Weizmann in RD-51)  GEM = Collaboration with IN2P3, CERN;  Micromegas = collaboration with CEA

Micro Pattern Gaseous detectors: All publications; centrality (Weizmann)

Ge detectors 2497 publications  Weizmann

Medipix2 + Timepix (244 pubs) Partner with NIKHEF, a member of the Medipix (2 & 3) collaborations Ge detectors Weizmann’s patent

Conclusion The current incarnation of the software could be used to solve some big problems related to the big data challenge Possibility to extend the software’s scope to be useful in new settings And remember, just use it and give feedback in our blog!