Presentation is loading. Please wait.

Presentation is loading. Please wait.

Cooperative Association for Internet Data Analysis CAIDA Activities TERENA – May 22, 2007 Colleen Shannon

Similar presentations


Presentation on theme: "Cooperative Association for Internet Data Analysis CAIDA Activities TERENA – May 22, 2007 Colleen Shannon"— Presentation transcript:

1 Cooperative Association for Internet Data Analysis CAIDA Activities TERENA – May 22, 2007 Colleen Shannon cshannon@caida.org

2 Cooperative Association for Internet Data Analysis What is CAIDA? Cooperative Association for Internet Data Analysis http://www.caida.org/ Goals include measuring and understanding the global Internet. Develop measurement and analysis tools Collect and provide Internet data: topology, header traces, routing, network security, DNS Visualization of the network

3 Cooperative Association for Internet Data Analysis Outline Macroscopic Topology Measurement Routing DNS Security COMMONS Data Collection, Curation, and Distribution –DatCat: the Internet Measurement Data Catalog Tools

4 Cooperative Association for Internet Data Analysis Macroscopic Topology Measurement (Brad Huffaker, Young Hyun) Skitter project continues: daily traceroute- like measurements to ~500,000 locations New active measurement infrastructure: Archipelago (Ark) –Architecture supports: Coordinated measurements (e.g. team probing) Shared use of a common measurement infrastructure Security measures to ensure responsible use and data integrity

5 Cooperative Association for Internet Data Analysis Routing (Dima Krioukov) Realistic topology generation –dK series graphs can generate topologies that satisfy a series of graph properties AS Adjacencies –Traceroute-based matrix of Internet AS-level graph AS Relationships, Classification, and Taxonomy –AS adjacencies annotated with information such as inferred customers and providers and IP address space

6 Cooperative Association for Internet Data Analysis Domain Name System (DNS) (Duane Wessels, Marina Fomenkov) DNS Surveys –Open resolvers (recursive name resolution to folks outside their administrative domain) –Cache poisoning (incorrect referrals for important domains) –Nameserver software prevalence RTT measurements to DNS root and gTLD servers

7 Cooperative Association for Internet Data Analysis Current Security Research (David Moore, Colleen Shannon) Nyxem/Blackworm/KamaSutra/MyWife –http://www.caida.org/analysis/security/blackworm/ Spamscatter Botnet Economics Worm Risk Analysis Anomaly Detection

8 Cooperative Association for Internet Data Analysis Internet ID Consumption IPv4 address space

9 Cooperative Association for Internet Data Analysis COMMONS (k claffy) Cooperative Measurement and Modeling of Open Networked Systems Problems: –Infrastructure financial crisis –Data acquisition crisis –Struggle for survival for emerging community/municipal wireless network Solution: Cooperative national backbone connecting community and municipal networks –Low-cost access for community wireless networks via shared network resources –Implicit support (and consent) for measurement activities

10 Cooperative Association for Internet Data Analysis CAIDA Datasets Freely available datasets Academic / Non-profit access datasets For-profit use: sponsor dataset creation –Join CAIDA: http://www.caida.org/home/legal/sponsorinfo.xml http://www.caida.org/home/legal/sponsorinfo.xml –US organizations: use PREDICT http://www.predict.org/

11 Cooperative Association for Internet Data Analysis Day in the Life of the Internet At-least annual measurement with as many networks participating as possible Most recent: January 9-10, 2007 –7 DNS participants (C root, F root, K root, M root, AS112, B ORSN, M ORSN) –5 network participants (WIDE, KAIST, POSTTECH, AMPATH, CAIDA) To join future DITL data collections, email ditl-info@caida.org

12 Cooperative Association for Internet Data Analysis Freely Available Data The following datasets are available to anyone who wishes to use them: –AS Adjacencies –Router Adjacencies –Code-Red Worm –Witty Worm –AS Relationships –AS Rank –AS Taxonomy

13 Cooperative Association for Internet Data Analysis Data available for non-profit use The following datasets are available to academic, government, and non-profit researchers: –Raw macroscopic topology traces (skitter) –OC48 peering point data –Denial-of-service attack backscatter (TOCS, 2004- 2005, 2006) –Witty Worm –DNS root/gTLD RTT data

14 Cooperative Association for Internet Data Analysis Internet Measurement Data Catalog http://imdc.datcat.org

15 Cooperative Association for Internet Data Analysis DatCat Goals (1) to facilitate searching for and sharing of data among researchers –Index as much as possible, including datasets not publicly available –DatCat doesn’t store any network data itself

16 Cooperative Association for Internet Data Analysis DatCat Goals (2) to enhance documentation of datasets via a public annotation system –Easy place for anyone (not just the dataset creator) to provide additional information –Persistent reference that stays with the dataset (not a footnote in a paper)

17 Cooperative Association for Internet Data Analysis DatCat Goals (3) to advance network science by promoting reproducible research –Test new technologies on consistent datasets to compare apples with apples

18 Cooperative Association for Internet Data Analysis DatCat lets you… Find data for research/engineering Annotate datasets to note features, background information, or bugs Cite data Contribute data (coming soon!)

19 Cooperative Association for Internet Data Analysis DatCat Status DatCat available for public viewing since June 12, 2006 Contribution interface open to beta-testers 76,708 data items 6 TB of data 33 Collections and Publications –15 non-CAIDA Data Collections (26 total) –6 non-CAIDA Publications (7 total)

20 Cooperative Association for Internet Data Analysis DatCat Example

21 Cooperative Association for Internet Data Analysis DatCat Example

22 Cooperative Association for Internet Data Analysis Collaboration Current: –CRAWDAD: Community Resource for Archiving Wireless Data at Dartmouth –MOME/MOMENT –UCSD-CSE, ICSI Future: –Abilene Observatory –RouteViews

23 Cooperative Association for Internet Data Analysis Next Steps Currently testing programmatic contribution interface Add support for Papers (specialized collection) Add support for tools GUI contribution interface

24 Cooperative Association for Internet Data Analysis For more information DatCat: http://imdc.datcat.org/ General questions and comments –info@datcat.orginfo@datcat.org Announcements –user-announce@datcat.orguser-announce@datcat.org Contribution beta-test –contribute@datcat.org

25 Cooperative Association for Internet Data Analysis PREDICT Overview Protected REpository for the Defense of Infrastructure against Cyber Threats –Problems PREDICT solves –Challenges thus far http://www.predict.org

26 Cooperative Association for Internet Data Analysis Why PREDICT? Most researchers do not have access to the data needed to research solutions to current security problems on the Internet Getting data requires cultivating personal relationships/trust over years (out of scope for academia) Significant security and privacy problems with distributing data Collecting, curating, and distributing data is expensive Getting data doesn’t scale for researchers Giving data doesn’t scale for providers

27 Cooperative Association for Internet Data Analysis PREDICT Goals Collect high-quality, relevant data Provide a minimally-secured index of available data Provide a robust legal and procedural framework to ensure the legality of distribution and appropriate handling of data Note: near lack of technology involved…

28 Cooperative Association for Internet Data Analysis PREDICT Challenges Getting commercial providers to sign Memos of Agreement is near impossible because it requires official acknowledgement that data is collected. –Also, it involves smart lawyers whose job is to minimize corporate risk. Correctly handling privacy is challenging – there is a clear research need for non-anonymized data. Distribution of non-anonymized data is inherently orthogonal to preserving privacy. –Finding middle ground takes time. Minimize government access to data (FOIA, bad press, big brother) –No one wants the government to have the data. Not even the government. Is it legal to collect data from a network that you do not own? –Few (if any) case histories to work from

29 Cooperative Association for Internet Data Analysis Progress! First step will only include non-anonymized data from non-commercial providers Anonymization helps with privacy; up-front meetings with privacy advocates very helpful (“model government program”) Procedures and review structure to minimize government involvement while protecting government interests Extensive legal research/documentation to support legality of collecting and distributing network data

30 Cooperative Association for Internet Data Analysis CAIDA Tools Measurement and analysis –CoralReef –Scamper –NeTraMet –DSC Visualization –Walrus –Cuttlefish –Otter

31 Cooperative Association for Internet Data Analysis Otter Example: AS Connectivity Map

32 Cooperative Association for Internet Data Analysis Walrus Example: Code-Red Worm

33 Cooperative Association for Internet Data Analysis Cuttlefish Example: Blackworm Virus (live demo)

34 Cooperative Association for Internet Data Analysis For more information… CAIDA Research: –http://www.caida.org/research/http://www.caida.org/research/ CAIDA Data: –http://www.caida.org/data/http://www.caida.org/data/ DatCat: –http://imdc.datcat.orghttp://imdc.datcat.org CAIDA Tools –http://www.caida.org/tools/

35 Cooperative Association for Internet Data Analysis Contact Information Questions about this talk: –cshannon at caida.org Questions about CAIDA in general –Info at caida.org Questions about CAIDA data –Data-info at caida.org Questions about the Day in the Life of the Internet (DITL) project –Ditl-info at caida.org Questions about DatCat –Info at datcat.org –Contribute at datcat.org


Download ppt "Cooperative Association for Internet Data Analysis CAIDA Activities TERENA – May 22, 2007 Colleen Shannon"

Similar presentations


Ads by Google