Australian Research Council Support ● 3-year (2004-2006) ARC Discovery Project Grant “New Methods for Researching the Existence and Impact of Political.

Slides:



Advertisements
Similar presentations
Incorporating Site-Level Knowledge to Extract Structured Data from Web Forums Jiang-Ming Yang, Rui Cai, Yida Wang, Jun Zhu, Lei Zhang, and Wei-Ying Ma.
Advertisements

The recent technological advances in mobile communication, computing and geo-positioning technologies have made real-time transit vehicle information systems.
Node-Attribute Graph Layout for Small-World Networks Helen Gibson Principal Supervisor: Dr. Paul Vickers 1 st Supervisor: Dr. Maia Angelova 2 nd Supervisor:
The North American Carbon Program Google Earth Collection Peter C. Griffith, NACP Coordinator; Lisa E. Wilcox; Amy L. Morrell, NACP Web Group Organization:
The Last Procedure Before First Functional Prototype Grant Boomer, Brett Papineau, Tanis Lopez, Archana Shrestha CS 383.
1 Generic logging layer for the distributed computing by Gene Van Buren Valeri Fine Jerome Lauret.
Breadth-First Search Seminar – Networking Algorithms CS and EE Dept. Lulea University of Technology 27 Jan Mohammad Reza Akhavan.
Design and Development of Duct-Diffuser Augmented Propeller Low Head Hydro Turbines Faculty of Engineering and the Environment Tauseef Ahmed –
A Presentation Management System for Collaborative Meetings Krzysztof Wrona (ZEUS) DESY Hamburg 24 March, 2003 ZEUS Electronic Meeting Management System.
The Ethics of Large-Scale Web Data Analysis (Webmetrics) Mike Thelwall, Statistical Cybermetrics Research Group, University of Wolverhampton, UK Rob Ackland,
Objective Understand web-based digital media production methods, software, and hardware. Course Weight : 10%
Topology Generation Suat Mercan. 2 Outline Motivation Topology Characterization Levels of Topology Modeling Techniques Types of Topology Generators.
Final Presentation WINTER 2009 – SUMMER 2009 PRESENTED BY: George Kour Hany Danial SUPERVISOR: Victor Kulikov Networked Software Systems Laboratory DEPARTMENT.
Stephen Ward & Rachel Gibson Oxford Internet Institute, University of Oxford ACSPRI Centre, Australian National University Parties and the Virtual Campaign:The.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. 1 The Architecture of a Large-Scale Web Search and Query Engine.
Force Directed Algorithm Adel Alshayji 4/28/2005.
2009 Ensemble Semantic Technologies for the Enhancement of Case Based Learning Patrick Carmichael, Project.
Force Directed Algorithm Adel Alshayji 4/28/2005.
Establishing Pairwise Keys in Distributed Sensor Networks Donggang Liu, Peng Ning Jason Buckingham CSCI 7143: Secure Sensor Networks October 12, 2004.
Analysing the link structures of the Web sites of national university systems Mike Thelwall Statistical Cybermetrics Research Group University of Wolverhampton,
Robert Huggins and Daniel Prokop Centre for International Competitiveness, Cardiff School of Management, University of Wales Institute, Cardiff Presentation.
What is R Muhammad Omer. What is R  R is the programing language software for statistical computing and data analysis  The R language is extensively.
31 January 2007Craig E. Ward1 Large-Scale Simulation Experimentation and Analysis Database Programming Using Java.
Analysis and Modeling of the Open Source Software Community Yongqin Gao, Greg Madey Computer Science & Engineering University of Notre Dame Vincent Freeh.
By LaBRI – INRIA Information Visualization Team. Tulip 2010 – version Tulip is an information visualization framework dedicated to the analysis.
Centre for Earth Systems Engineering Research Infrastructure Transitions Research Consortium (ITRC) David Alderson & Stuart Barr What is the aim of ITRC?
The Internet in Education Objectives Introduction Overview –The World Wide Web –Web Page v. Web Site v. Portal Unique and Compelling Characteristics Navigation.
Strategies for improving Web site performance Google Webmaster Tools + Google Analytics Marshall Breeding Director for Innovative Technologies and Research.
Programming the Web Web = Computer Network + Hypertext.
Learning Through Social Connection Hannah Beaman Online Communities and Web Development Manager SocialLearn.
Fundamentals of Database Chapter 7 Database Technologies.
ANDS Back to Basics Workshop Research Data Management and the ECU Library Poh Lin Teow & Gordon McIntyre Librarians: Research Services Bentley Technology.
Using Hyperlink structure information for web search.
Web Categorization Crawler Mohammed Agabaria Adam Shobash Supervisor: Victor Kulikov Winter 2009/10 Design & Architecture Dec
Automatically Extracting Data Records from Web Pages Presenter: Dheerendranath Mundluru
HTML. Principle of Programming  Interface with PC 2 English Japanese Chinese Machine Code Compiler / Interpreter C++ Perl Assembler Machine Code.
Web Search. Structure of the Web n The Web is a complex network (graph) of nodes & links that has the appearance of a self-organizing structure  The.
LOGO 2 nd Project Design for Library Programs Supervised By Dr: Mohammed Mikii.
Group ID: Prepared By: Jubin Goswami Milan Valambhiya.
1 Smashing Peacocks Further: Drawing Quasi-Trees from Biconnected Components Daniel Archambault and Tamara Munzner, University of British Columbia David.
Contents 1.Introduction, architecture 2.Live demonstration 3.Extensibility.
COM1721: Freshman Honors Seminar A Random Walk Through Computing Lecture 2: Structure of the Web October 1, 2002.
OpenWeb: Expanding access to Digital Collections Marshall Breeding Director for Innovative Technologies and Research Vanderbilt University
Crystal25 Hunter Valley, Australia, 11 April 2007 Crystal25 Hunter Valley, Australia, 11 April 2007 JAINIS (JCU and Indiana Instrument Services): A Grid.
Under the hood source files in java J2SE, Tomcat, Pellet, MySQL, Spring 851 MB code base.
 Copyright 2011 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Enabling Networked Knowledge.
PHP and mySQL 2/9/2007. What is PHP?  From php.net “PHP is a widely-used general- purpose scripting language that is especially suited for Web development.
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
For: CS590 Intelligent Systems Related Subject Areas: Artificial Intelligence, Graphs, Epistemology, Knowledge Management and Information Filtering Application.
SOFTWARE DESIGN. INTRODUCTION There are 3 distinct types of activities in design 1.External design 2.Architectural design 3.Detailed design Architectural.
Web Intelligence Complex Networks I This is a lecture for week 6 of `Web Intelligence Example networks in this lecture come from a fabulous site of Mark.
Mining real world data Web data. World Wide Web Hypertext documents –Text –Links Web –billions of documents –authored by millions of diverse people –edited.
Harvesting Social Knowledge from Folksonomies Harris Wu, Mohammad Zubair, Kurt Maly, Harvesting social knowledge from folksonomies, Proceedings of the.
The Structure of the Web. Getting to knowing the Web How big is the web and how do you measure it? How many people use the web? How many use search engines?
IHacky Jon Lao Hong Nguyen Marcius Bagwan. iHacky Goals: Widen the social level of the developer community by popularizing their ways of software development.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 14 Database Connectivity and Web Technologies.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
NeOn Components for Ontology Sharing and Reuse Mathieu d’Aquin (and the NeOn Consortium) KMi, the Open Univeristy, UK
“Niche Work” Graham J Wills, Lucent Technologies (Bell Lab)
Understanding Web-Based Digital Media Production Methods, Software, and Hardware Objective
GROUP PresentsPresents. WEB CRAWLER A visualization of links in the World Wide Web Software Engineering C Semester Two Massey University - Palmerston.
Web mining is the use of data mining techniques to automatically discover and extract information from Web documents/services
Developer Exam Preparation Thom Robbins Bryan Soltis
C. Bruce Entwistle Science and Operations Officer Aviation Weather Center Kansas City, MO C. Bruce Entwistle Science and Operations.
The New NAP Members’ Area Development. Elgg What is elgg? –Elgg is an award-winning open source social networking platform.
Information Systems Design and Development
Objective Understand web-based digital media production methods, software, and hardware. Course Weight : 10%
Spanning Tree Algorithms
Trees Kun-Mao Chao (趙坤茂)
Graphs G = (V,E) V is the vertex set.
Presentation transcript:

Australian Research Council Support ● 3-year ( ) ARC Discovery Project Grant “New Methods for Researching the Existence and Impact of Political Networks on the WWW – Robert Ackland and Rachel Gibson (ANU) ● ARC Special Research Initiative (e-Research Support) Grant to establish VOSON and conduct demonstrator project – Robert Ackland, Rachel Gibson, Mathieu O'Neil (ANU), Bruce Bimber (UCSB), Stephen Ward (OII, Oxford)

Software - current status ● VOSON is "powered" by uberlink - web-based research software that facilitates the collection and analysis of online network data. uberlink is built using open-source software components and features: – PhP and javascript web interface – MySQL database – Perl-based web crawler and – Interface to the Google API – Data manipulation and analysis routines Perl and C++.

Analysis of data from WWW ● Data preparation – Web data are inherently noisy - the inclusion of irrelevant pages into the study ("topic drift") needs to be minimised. – Choice of unit of analysis - while data are collected at the page-level, analysis is generally conducted over aggregations of pages - need methods for meaningfully aggregating pages. – Machine learning methods will be useful.

Analysis of data from WWW ● Data visualisation – Visualisation of networks is important for their study. – Key is to find visualisation software that can work with large network graphs within web-based application. – Directed minimum spanning tree showing all nodes connected to a particular root node is displayed using LGL layout algorithm.

Outbound links from - LGL layout

Outbound links from - HypViewer

Analysis of data from WWW ● Data visualisation (cont.) – The LGLViewer provides an abstraction of a network since it only shows the shortest path between the root node and all other connected nodes in the database. – To visualise all nodes and all links simultaneously a force-directed graphing (FDG) algorithm is used. – Web sites are given initial random positions and modelled as electrostatic charges (global repulsion forces). Hyperlinks between web sites are modelled as springs (attraction forces) that move nodes to minimise the energy of the system thus revealing web clusters.

Analysis of data from WWW ● Data analysis – Crosstabulations (composition of dataset and links to/from seed sites) is provided in uberlink – Network datasets can be downloaded and analysed further using social network analysis software – Plans to access R sna package routines from within uberlink

Research projects ● New forms of Collection Action on the WWW ● Online networking behaviour of political parties ● Structural properties of far-right networks ● Information on the WWW for potential migrants ● The abortion debate on the WWW ● Sampling of web data ● Dynamics of conflict in online communities: a field theoretical approach ● Modelling the link economy

e-Research plans ● While uberlink is currently generating data and analysis for research, there are clear technological constraints relating to data management, computation and resource sharing that prevent large-scale collaborative research. We aim to overcome these constraints via the use of e-research technologies, possibly by exposing key features of uberlink (computational and webmining code, visualisation engines, databases) as Grid or web services.