[GISCUP2013] Mailing List Q&A + Project Discussion Ashok Dahal.

Slides:



Advertisements
Similar presentations
Testing Relational Database
Advertisements

5&6 November th WISE TG Meeting Vienna1 GIS guidance document for WISE Chapter 4.2 Update of existing datasets (Lead: UBA, Co-Authors: JRC)
Intro to Computer Org. Pipelining, Part 2 – Data hazards + Stalls.
Week 5: Loops 1.  Repetition is the ability to do something over and over again  With repetition in the mix, we can solve practically any problem that.
Programming Types of Testing.
1 Project Smart Metering Portfolio Foundation Updates June 2013.
Rachelle Howell The University of Texas at Austin.
Exercise lecture : Exercise 2 and 3 Rune / Yun. Overview Intro to exercise 3 Aspects from exercise 2.
1 Introduction to Computability Theory Lecture12: Reductions Prof. Amos Israeli.
Tutorial 6 & 7 Symbol Table
CS 106 Introduction to Computer Science I 02 / 28 / 2007 Instructor: Michael Eckmann.
1 Programming & Programming Languages Overview l Machine operations and machine language. l Example of machine language. l Different types of processor.
Need your MyMathLab card with your access code Need a Valid Address Need to know Purdue’s zip code is and your course ID for your Class You.
Programming. Software is made by programmers Computers need all kinds of software, from operating systems to applications People learn how to tell the.
Section 1.3 Prime numbers and fractions
Election Algorithms. Topics r Issues r Detecting Failures r Bully algorithm r Ring algorithm.
Merging census aggregate statistics with postal code-based microdata Laine Ruus University of Toronto. Data Library Service ,
1 Chapter-01 Introduction to Computers and C++ Programming.
Programmer Defined Functions Matthew Verleger. Windows It’s estimated that Window’s XP contains 45 million lines of code (and it’s over 10 years old).
400 7 th Street, SW | Washington | DC w w w. d o t. g o v MARAD Virtual Office of Acquisitions (VOA) NS Savannah Proposal May 24, 2006.
PCSpim How to Program ?. Some Resource There are some useful online document! You can find the links on our TAs’ website. tw/~xdd/Arc06/
Nachos Phase 1 Code -Hints and Comments
GIS CUP 13 – Team Project. Introduction GIS-focused algorithm competition The 2013 contest is about geo-fencing location-based advertisements child location.
Arrays Module 6. Objectives Nature and purpose of an array Using arrays in Java programs Methods with array parameter Methods that return an array Array.
1 Navigating the Letter of Recommendation Section on AMCAS For Medical School Applicants using the Letter of Recommendation Service provided by Lewis Associates.
Introduction To PROLOG World view of imperative languages. World view of relational languages. A PROLOG program. Running a PROLOG program. A PROLOG.
20 Oct - Overview Homework #1 Group-Id rule Notes on Java text file input/output –Scanner class –Printf (like C)
Data Structures and Algorithms Introduction to Algorithms M. B. Fayek CUFE 2006.
Making Decisions uCode: October Review What are the differences between: o BlueJ o Java Computer objects represent some thing or idea in the real.
An Introduction to Programming with C++ Sixth Edition Chapter 7 The Repetition Structure.
CS 4720 Dynamic Web Applications CS 4720 – Web & Mobile Systems.
CMP 131 Introduction to Computer Programming Violetta Cavalli-Sforza Week 3, Lecture 1.
IR Homework #1 By J. H. Wang Mar. 5, Programming Exercise #1: Indexing Goal: to build an index for a text collection using inverted files Input:
Renesas Technology America Inc. 1 M16C Seminars Lab 3 Creating Projects Using HEW4 14 March 2005 M16C Seminars Lab 3 Creating Projects Using HEW4 Last.
TDTIMS Overview What is TDTIMS? & Why Do We Do It?
Chapter 3: Developing Class Methods Object-Oriented Program Development Using Java: A Class-Centered Approach.
JAVA: An Introduction to Problem Solving & Programming, 6 th Ed. By Walter Savitch ISBN © 2012 Pearson Education, Inc., Upper Saddle River,
1 Navigating the Letter of Recommendation Section on AMCAS For Medical School Applicants using the Letter of Recommendation Service provided by Lewis Associates.
Chapter 8 Arrays. A First Book of ANSI C, Fourth Edition2 Introduction Atomic variable: variable whose value cannot be further subdivided into a built-in.
Chapter 5 Linked List by Before you learn Linked List 3 rd level of Data Structures Intermediate Level of Understanding for C++ Please.
Lecture 7 Conditional Scripting and Importing/Exporting.
JAVA: An Introduction to Problem Solving & Programming, 5 th Ed. By Walter Savitch and Frank Carrano. ISBN © 2008 Pearson Education, Inc., Upper.
Arrays Chapter 7. MIS Object Oriented Systems Arrays UTD, SOM 2 Objectives Nature and purpose of an array Using arrays in Java programs Methods.
BIT 115: Introduction To Programming Professor: Dr. Baba Kofi Weusijana (say Doc-tor Way-oo-see-jah-nah, Doc-tor, or Bah-bah)
Machine Learning Chapter 7. Computational Learning Theory Tom M. Mitchell.
CS4432: Database Systems II
Your current Moodle 1.9 Minimum Requirements Ability to do a TEST RUN! Upgrading Moodle to Version 2 By Ramzan Jabbar Doncaster College for the Deaf By.
Virginia Administrative Training Module 1: Processing, Online, Scoring and Reporting Training Presentation Training Presentation Working Within PearsonAccess.
Introduction to Computer Programming Concepts M. Uyguroğlu R. Uyguroğlu.
Component D: Activity D.3: Surveys Department EU Twinning Project.
Arrays Chapter 7.
Advanced Higher Computing Science
Tips for using Java with Internet Explorer
Journal of Mountain Science (JMS)
JavaScript/ App Lab Programming:
Introduction to Computer Science
Objectives In this lesson, you will learn to: Define stacks
Data File Import / Export
First discussion section agenda
Please use speaker notes for additional information!
Tutorial for LightSIDE
Java Programming Loops
Machine Learning: Lecture 3
Programming.
Java Programming Loops
Finding Limits Graphically and Numerically
Arrays.
Programming Assignment Tutorial
Portfolio Information PPT
Running & Testing Programs :: Translators
Presentation transcript:

[GISCUP2013] Mailing List Q&A + Project Discussion Ashok Dahal

Overview Discussion of questions asked by registered members Responses of GISCUP2013 team Discussion of GISCUP updates Project Discussion

Q&A - Deadline Q: When is the submission deadline? (01/17) A: The deadline is August 1st, 2013(01/21)

GISCUP2013 Update Dataset changed(02/27) Summary of changes: – Some Polygon/Point pairs were incorrectly not reported in the result set. – In the new version, we provide the polygons (I.e., stored in poly10.txt. And poly15.txt) sorted by the sequence number in ascending order. From now on, you can assume that all polygons data are given to you sorted that way.

Q&A – Data Size Limit How many points and polygons? Each object(point/polygon) will have number of instances(point/polygon with timestamp) According to problem statement: the maximum number of points and polygons will be no more than 1M and 500 respectively. The question is: that many objects or instances?

Q&A – Data Size Limit contd. Sample data provided has point files with 500 and 1000 points and polygon files with 10 and 15 polygon. Point500.txt = 39,289 lines(instances) Point1000.txt = 69,619 lines(instances) Poly10.txt = 30 lines (instances) Poly15.txt = 40 lines (instances)

Q&A – Data Size Limit [Response] The size limit applies to the number of instances of the points and polygons. That is, the total number of points in the points file will be less than 1M and the total number of polygons in the polygon file will be less than 500. To be more specific, the number of lines in the points file will be less than 1M and the number of lines in the polygon file will be less than 500.

Q&A – Data Size Limit arguments Argument: Determining whether points are in a single polygon which gets redefined 500 times is a much easier problem than having 500 distinct polygons defined at the same time the whole time.  In real world it may not happen. Response: we have to restrict certain dimensions of the problem to make it practical for a contest.

Q&A – Data Size Limit arguments contd. Argument: Do we care how many lines are in the polygon input file or we care how many polygons can be defined at a given time? Response: – The maximum number of polygons that can be defined at a given time is 500. In this case, none of the polygons would move, only the points will move. – The minimum number of polygons that can be defied at a given time is 1. In this case, the polygon can move 499 times.

Q&A – Defining Polygons Question: In the sample files, all of the polygons are defined at once with the first several timestamps, before any points are defined. Will all of the polygons initially defined at the start? Or is it possible new polygons will appear later on? Response: All the initial polygons will be defined before any points are defined as we did in the sample files

Q&A – Defining Polygons[Arguments] Argument: Actually, the sample files do not agree with that statement. The sample file poly15.txt has a Polygon with ID 0 which does not get defined until timestamp , and sample file poly10.txt has a Polygon with ID 0 which does not get defined until Response: This is a data error. We will fix it and redo the test files. There should not be any polygons with an ID less than 1.

Q&A – Evaluation Machine Question: can we assume a Java Runtime Environment installation to be present on the evaluation machine? Response: Yes, you can assume a JDK 1.6 version.

Your Own Questions Do you also want to ask some specific questions? cs/listinfo/GISCup2013 Get to this link and register so that you can ask questions to them. You will also get s when somebody asks question and when GISCUP team responds.

Project Discussion The data set we are going to use for evaluation will be way bigger than the sample files provided in the CUP website. Example – no of lines in points500.txt file can go up to 1M from current 39,289 lines. Similarly, no of lines in poly10.txt can go up to 500 lines from current 30 lines. You need to work on speeding up your program since large dataset can take a lot of time to get processed and validate your output(Why?)

Project Discussion contd. My Experience: – I am using two methods to check a point INSIDE a polygon(Ray-casting and Winding number method). The algorithms are exhaustive which means no speeding up is done yet. – Programming language : PERL. – Speed and accuracy wise, both method seems similar. – It is taking around 800s for using points500.txt(39,289 instances)with poly10.txt(30 instances). – How long will it take for points500.txt(1M instances) and poly10.txt(500 instances)?

Project Discussion contd. So Speeding Up is a MAJOR factor. Accuracy: – All pairs matching. – Initially, I had more than 20,000 pairs in my output which means, I had more than 9600 extra pairs. – If there are extra pairs, your score will go down because each extra pair will decrement the score by 1. – So accuracy is another MAJOR factor. – HINTS for accuracy: Remove all the extra pairs based on the problem definition. i.e. check time stamp of point vs. polygon and check if the polygon is already expired.

Project Discussion contd. Remember that you also have to do, WITHIN not only INSIDE the polygon. Things you need to consider: – Start early!!! – Work on the speed. Apply filtering as discussed in the class. If you can utilize multi core CPU, that is awesome. – Work on accuracy.