Eric C. Glass & Dana Neacsu

Slides:



Advertisements
Similar presentations
Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
Advertisements

Rightslink: Research Tips: Domestic and International Law Dana Neacsu, Fall 2007.
HEALTH SCIENCES CENTER LIBRARY LIBRARY ORIENTATION.
Introduction to Mendeley. What is Mendeley? Mendeley is a reference manager allowing you to manage, read, share, annotate and cite your research papers...
Lecture №2 State System of Scientific and Technical Information.
Tara Guthrie, 2012 Types of Resources: Electronic.
Input Validation For Free Text Fields ADD Project Members: Hagar Offer & Ran Mor Academic Advisor: Dr Gera Weiss Technical Advisors: Raffi Lipkin & Nadav.
Library Research Strategies PO 517: Liberalism, Democracy, & American Foreign Policy PO 522: International Institutions, Public & Private Prof. David Deese.
Economics of Reform and Transition AUBG Library Resources Gergana Georgieva Information Literacy Librarian March 11, 2009.
Interdisciplinary Research in Law: Strategies for Database Selection: patent research.
Araba Dawson-Andoh 122 A Alden Library
Computer Science – Information Literacy Seminar ODUCS Information Literacy.
Starting your Research by Choosing Databases. Which databases to use depends on: What’s available that’s relevant Where you are and what resources are.
1 Urban Education Resources LIBRARY INSTRUCTION Jacqueline A. Gill Associate Professor Reference
P RESENTED BY M ARGARET C LARK R EFERENCE L IBRARIAN SPRING 2009 Researching International Humanitarian Law.
Xpantrac connection with IDEAL Sloane Neidig, Samantha Johnson, David Cabrera, Erika Hoffman CS /6/2014.
Using LIRN® Guide Click here to continue. Click here to exit. Click here to go to the Table of Contents.
ILDS, Tallinn, September 21, 2005 Jindřiška Pospíšilová National Library of the Czech Republic Uniform Information Gateway.
Robert Currier, Mote Marine Laboratory Dr. Barbara Kirkpatrick, TAMU/GCOOS.
BEST KEPT SECRETS: LOCATING FEDERAL INFORMATION Valery King, Government Information Librarian Oregon State University Libraries & Press ORSLA – March 7,
English 115 Subject Databases Hudson Valley Community College Marvin Library Learning Commons 1.
MADGIC is… MAPS and ATLASES DATA: NUMERIC and GEOSPATIAL (for use with special software) GOVERNMENT INFORMATION (parliamentary and other official reports,
Introduction to Mendeley. What is Mendeley? Mendeley is a reference manager allowing you to manage, read, share, annotate and cite your research papers...
MADGIC is… MAPS and ATLASES DATA (NUMERIC and GEOSPATIAL) for use with special software GOVERNMENT INFORMATION (parliamentary and other official reports,
Lithuanian ETD Project: Past, Present and Future Developments 8th International Electronic Theses and Dissertations Symposium, Sydney, 28 – 30 September.
Searching Websites and Databases Sonya Garza 6340 MTT Class Fall 2006.
WISER Social Sciences: Politics & International Relations Gillian Beattie (Social Science Library) Jane Rawson (Vere Harmsworth Library)
Interdisciplinary Research in Law: Companies Facing Technical Barriers to Trade.
Lesson 2: Basic HTML Code Basic HTML Code. HTML is an acronym for Hypertext Markup Language. Internet browsers translate the HTML code into texts and.
SOC 503 Techniques & Methods of Social Science Data Resources at Princeton University.
Encouraging An Informed Citizenry: Locating and Using Congressional Research Service Reports Starr Hoffman Librarian for Digital Collections University.
Mr. P’s Class Term Paper All the Steps on the Path to an “A” Term Paper in World History.
Sabinet: SA ePublications Compiled by Helene van der Sandt.
Government Documents Made Easy? Or Just Easier?. 90% since % since 2000 Retrospective conversion projects Retrospective conversion projects Search.
Comprised by Mrs. Goodwin Search engines based on subject-area: Please note: I’ve included a summary beside each search engine listed… in color.
INTRO TO LIBRARY RESEARCH LITERATURE REVIEWS. RESEARCH IN 5 STEPS 1.Define your topic; brainstorm keywords 2.Choose the right tool 3.Keyword search 4.Cycle.
Interdisciplinary Research in Law: Do you have enough facts to build an argument?
Federal Regulations Federal regulations are the third primary source of American law discussed. Proposed regulations and final regulations are published.
EBSCOHost Your source for periodical and journal articles. 31 slides.
Jacynthe Touchette, MSI JGH Health Sciences Library
Yasmin Saira & Dana Neacsu
Dana Neacsu. PhD Ian Beilin, PhD
Interdisciplinary Research in Law: Data Mining in Legal Matters
Introduction to Library Resources
Dr. Z’s Top Ten Effective Research Strategies
ruralhealthinfo.org or
Research for Seminar Papers
Federal & State Legislative Research
Interdisciplinary Research in Law:   Treaty, Legislative, and Administrative History Research Dana Neacsu.
Lesson 11: Web Services & API's
Fadi Dagher & Dana Neacsu
Locating Published Research
A SPEAKER’S GUIDEBOOK 4TH EDITION CHAPTER 9
Scopus - Elsevier (Advanced Course Module 8)
Serge Noiret, History Information Specialist
ruralhealthinfo.org or
Web scraping tools, an introduction
Big Data on the Web News Gathering.
Introduction to Library Services & Resources for Schulich PhD Students
Health On-Line Patient Education Web Site
Interdisciplinary Legal Research: Data Mining as Preliminary Research (I) Dana Neacsu.
Interdisciplinary Research in Law: Gov Docs: Domestic and Foreign
Interdisciplinary Legal Research: Data Mining in Legal Matters
Bryan Burlingame 24 April 2019
STATUTE LAW SOURCES. PUBLIC SCHOOL LAW Part 10: Primary Legal Sources- Legislative (Statute) Law.
The Road to Research Success Graduate Student Workshop Data and Statistics Beth Kaylor Coordinator of Business, Entrepreneurship and Government Information.
Alberta Queen’s Printer
Collecting a Research Library
Business Databases: Research Articles
Criminal Justice Databases: Research Articles
Presentation transcript:

Eric C. Glass & Dana Neacsu Data Mining (scraping) & Interdisciplinary Research in Law:   Gov Docs: Domestic and Foreign Eric C. Glass  & Dana Neacsu

Web Scraping Extracting and parsing formatted data from a web page *(HTML, XHMTL, JSON etc.). Automated or manual Python Beautiful Soup - https://www.crummy.com/software/BeautifulSoup/  Toolkit for dissecting a document and extracting what you need. It doesn't take much code to write an application Manages encodings Sits on top of popular Python parsers like lxml and html5lib Gathering election results example: http://www.b-list.org/weblog/2010/nov/02/news-done-broke/

Web Scraping Automated tools (no programing) Import.io Webscraper.io Cloud based web application https://www.import.io/ No longer free apparently, but free trial is available Webscraper.io Chrome browser plug in available free http://webscraper.io/  Sitemap building, data extraction and export are all done within browser Have not used, but there is Youtube: https://www.youtube.com/watch?v=y00t5NpW7pY

Web Scraping Premade tools (many applications on GitHub) Example – NYPD Crash Data Band Aid http://blog.johnkrauss.com/nypd-crash-data-band-aid/ On Github - https://github.com/talos/nypd-crash-data-bandaid NYPD released data based through “idiotically obfuscated PDFs” Tool is built in python and on top of xpdf and wget

Training and Help Lynda.com (through libraries license) Code Academy. Python: Programming Efficiently Code Academy. Python intro in addition to a variety of web based APIs Digital centers in the libraries Python open lab R open lab Collaboratory at Columbia University An appointment-based free consulting service for students and researchers at Columbia University that offers assistance with planning and executing data driven research projects, including help with data visualization, analysis and prediction, both in conceptual terms and with concrete software implementations. https://www.surveymonkey.com/r/CollaboratoryClinic

Read your question for research clues: Q. 1. What is the connection between gun ownership and public health?

Open Data Map of open data policies: NYC Open data NYS Open Data http://www.opendatapolicies.org/browse/ NYC Open data On March 7, 2012, former Mayor Bloomberg signed Local Law 11 of 2012, more commonly known as the “Open Data Law,” which amended the New York City administrative code to mandate that all public data be made available on a single web portal by the end of 2018. https://opendata.cityofnewyork.us/ NYS Open Data FOIA & FOIL requests https://www.dos.ny.gov/coog/freedomfaq.html#denybroad

Collections Inter University Consortium for Political and Social Research (ICPSR) http://www.icpsr.umich.edu/icpsrweb/ICPSR/ A rich data archive of over 7,500 titles presented with full documentation and most with data formatted for use in standard statistical packages. ProQuest Statistical Insight https://clio.columbia.edu/catalog/2334507 Provides statistical data from U.S. government publications from 1973, state and private sources from 1980, and international organizations from 1983. Historical Statistics of the United States https://clio.columbia.edu/catalog/5634151 ProQuest statistical abstract of the U.S.  http://www.columbia.edu/cgi-bin/cul/resolve?clio10126076 DSSC data catalogs http://library.columbia.edu/locations/dssc.html Social Sciences resources - http://library.columbia.edu/locations/dssc/data/socsc.html

Preliminary meta-research 1. Free of charge databases. 2 Preliminary meta-research 1.Free of charge databases? 2. Fee-based databases? How do you choose? Who publishes them? Where can you access them? Open data: USA.gov, Data.gov, Data.un.org Government Agency websites Sunlight Foundation Closed data: ICPSR Govistics Columbia Spatial Data Catalog

Read your question for research clues: Q. 2.  Creating a false appearance of active trading in the market by investors is a domestic and international problem. Can it be regulated?

Clarify your question (break it into smaller concepts) Do you need contextual information? (literature search) Can you go for the primary source, the rule? Where do you start?

More preliminary meta-research domestic law research 1 More preliminary meta-research domestic law research 1.Free of charge databases? 2. Fee-based databases? How do you choose? Who publishes them? Where can you access them? Remember a mere Google search may give you the right starting point

Answer – US regulations Doing the research! Depending on your research needs you may or may not skip the literature search (regular law-based databases) Find gov docs free-of charge databases Use a library guide Gov reports (CRS reports) Agency reports and other activities Use fee-based databases Bloomberglaw.com Practical Law (from Westlaw)

More preliminary meta-research foreign and international law research 1.Free of charge databases? 2. Fee-based databases? How do you choose? Who publishes them? Where can you access them? Remember a mere Google search may give you the right starting point

Answer – EU regulations Doing the research! Depending on your research needs you may or may not skip the literature search (regular law-based databases) The main EU database is free of charge: Europa Use a library guide Use the database itself Find legislation on your topic The main UN databases are free of charge UNCITRAL Google Searches

Questions? Eric C. Glass Email: ecg2104@columbia.edu Dana Neacsu Email: edn13@columbia.edu