1 Some Guidelines for Good Research Dr Leow Wee Kheng Dept. of Computer Science.

Slides:



Advertisements
Similar presentations
Image Retrieval: Current Techniques, Promising Directions, and Open Issues Yong Rui, Thomas Huang and Shih-Fu Chang Published in the Journal of Visual.
Advertisements

CSC321: Introduction to Neural Networks and Machine Learning Lecture 24: Non-linear Support Vector Machines Geoffrey Hinton.
Unsupervised Learning Clustering K-Means. Recall: Key Components of Intelligent Agents Representation Language: Graph, Bayes Nets, Linear functions Inference.
Experiments and Variables
Support Vector Machines
Computer vision: models, learning and inference Chapter 8 Regression.
Using the Crosscutting Concepts As conceptual tools when meeting an unfamiliar problem or phenomenon.
Ranked Retrieval INST 734 Module 3 Doug Oard. Agenda  Ranked retrieval Similarity-based ranking Probability-based ranking.
Computer Vision Group, University of BonnVision Laboratory, Stanford University Abstract This paper empirically compares nine image dissimilarity measures.
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Evaluation.  Allan, Ballesteros, Croft, and/or Turtle Types of Evaluation Might evaluate several aspects Evaluation generally comparative –System A vs.
Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 4: Modeling Decision Processes Decision Support Systems in the.
Cluster Analysis.  What is Cluster Analysis?  Types of Data in Cluster Analysis  A Categorization of Major Clustering Methods  Partitioning Methods.
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
Detecting and Tracking Moving Objects for Video Surveillance Isaac Cohen and Gerard Medioni University of Southern California.
Overview of The Operations Research Modeling Approach.
INFO 624 Week 3 Retrieval System Evaluation
Retrieval Evaluation. Brief Review Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
Silicon Prairie Initiative on Robotics in Information Technology
Silicon Prairie Initiative on Robotics in Information Technology
1 CSC 1401 S1 Computer Programming I Hamid Harroud School of Science and Engineering, Akhawayn University
Learning Techniques for Information Retrieval We cover 1.Perceptron algorithm 2.Least mean square algorithm 3.Chapter 5.2 User relevance feedback (pp )
Approaches to Representing and Recognizing Objects Visual Classification CMSC 828J – David Jacobs.
Maximum likelihood (ML)
The Art of Programming Top-Down Design. The Art of Problem Solving The art of problem solving is the transformation of an English description of a problem.
By Saparila Worokinasih
Slide Image Retrieval: A Preliminary Study Guo Min Liew and Min-Yen Kan National University of Singapore Web IR / NLP Group (WING)
Mehdi Ghayoumi Kent State University Computer Science Department Summer 2015 Exposition on Cyber Infrastructure and Big Data.
 1  Outline  stages and topics in simulation  generation of random variates.
 Let A and B be any sets A binary relation R from A to B is a subset of AxB Given an ordered pair (x, y), x is related to y by R iff (x, y) is in R. This.
Purpose of study A high-quality computing education equips pupils to use computational thinking and creativity to understand and change the world. Computing.
11 Writing a Conference Research Paper Miguel A. Labrador Department of Computer Science & Engineering
Shape Matching for Model Alignment 3D Scan Matching and Registration, Part I ICCV 2005 Short Course Michael Kazhdan Johns Hopkins University.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Random Sampling, Point Estimation and Maximum Likelihood.
Introduction CS 3358 Data Structures. What is Computer Science? Computer Science is the study of algorithms, including their  Formal and mathematical.
Standards for Mathematical Practice #1 Make sense of problems and persevere in solving them. I can: explain the meaning of a problem. choose the right.
CS433: Modeling and Simulation Dr. Anis Koubâa Al-Imam Mohammad bin Saud University 15 October 2010 Lecture 05: Statistical Analysis Tools.
1 Lesson 8: Basic Monte Carlo integration We begin the 2 nd phase of our course: Study of general mathematics of MC We begin the 2 nd phase of our course:
Retrieval Models for Question and Answer Archives Xiaobing Xue, Jiwoon Jeon, W. Bruce Croft Computer Science Department University of Massachusetts, Google,
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
1 CS 430 Database Theory Winter 2005 Lecture 16: Inside a DBMS.
Science Fair How To Get Started… (
Major objective of this course is: Design and analysis of modern algorithms Different variants Accuracy Efficiency Comparing efficiencies Motivation thinking.
1 Multiple Classifier Based on Fuzzy C-Means for a Flower Image Retrieval Keita Fukuda, Tetsuya Takiguchi, Yasuo Ariki Graduate School of Engineering,
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Chapter 8 Evaluating Search Engine. Evaluation n Evaluation is key to building effective and efficient search engines  Measurement usually carried out.
Solve Absolute Value Inequalities © 2011 The Enlightened Elephant.
Classification (slides adapted from Rob Schapire) Eran Segal Weizmann Institute.
Data Mining, ICDM '08. Eighth IEEE International Conference on Duy-Dinh Le National Institute of Informatics Hitotsubashi, Chiyoda-ku Tokyo,
Fall 2008Programming Development Techniques 1 Topic 17 Assignment, Local State, and the Environment Model of Evaluation Section 3.1 & 3.2.
Chapter. 3: Retrieval Evaluation 1/2/2016Dr. Almetwally Mostafa 1.
Business Project Nicos Rodosthenous PhD 08/10/2013 1
 In the previews parts we have seen some kind of segmentation method.  In this lecture we will see graph cut, which is a another segmentation method.
Lecture 5: Statistical Methods for Classification CAP 5415: Computer Vision Fall 2006.
Computacion Inteligente Least-Square Methods for System Identification.
May 2003 SUT Color image segmentation – an innovative approach Amin Fazel May 2003 Sharif University of Technology Course Presentation base on a paper.
Relevant Document Distribution Estimation Method for Resource Selection Luo Si and Jamie Callan School of Computer Science Carnegie Mellon University
1 Kernel Machines A relatively new learning methodology (1992) derived from statistical learning theory. Became famous when it gave accuracy comparable.
By: Megan Funk. I will: 1. Explain the binary number system How to: -Generate binary from a number -Add binary 2. Explain the base-b number system 3.
Solve Absolute Value Inequalities © 2011 The Enlightened Elephant.
Big data classification using neural network
CSPs: Search and Arc Consistency Computer Science cpsc322, Lecture 12
Answering ‘Where am I?’ by Nonlinear Least Squares
Project Implementation for ITCS4122
Introduction PCA (Principal Component Analysis) Characteristics:
Cluster Validity For supervised classification we have a variety of measures to evaluate how good our model is Accuracy, precision, recall For cluster.
Image Segmentation.
Multi-Information Based GCPs Selection Method
Retrieval Performance Evaluation - Measures
Presentation transcript:

1 Some Guidelines for Good Research Dr Leow Wee Kheng Dept. of Computer Science

2 Research is...  extending human understanding?  publishing papers?  solving a challenging puzzle?  making things work?  having fun?

3 Research involves...  selecting a topic  studying existing work  solving problems  publishing results

4 Problem Solving  define problem  specify requirements/quality of solution  formulate solution method  evaluate performance of method

5 Problem Definition  Purpose  clarify problem  focus attention  Define problem precisely  It's usually difficult to define problem. So, just skip it?  The more difficult it is, the stronger is the need, the harder one should try.

6 Problem Definition Example: image retrieval 1st trial: Retrieve images that a user wants. Problem: too vague  What does user want?

7 Problem Definition 2nd trial: Retrieve images that are similar to a specific example called the query. Better, but still vague:  What does it mean by “similar”?

8 Problem Definition 3nd trial: Given a query Q, retrieve images I i, i = 1,...,n, such that a similarity measure s(Q, I i ) is large. Better, but  What is this s(Q, I i )? Notice  There’s an input: Q  There’re expected outputs: I i  There’s requirement specification: large s(Q, I i )

9 Problem Definition 4th trial: Given a query Q, retrieve images I i, i = 1,...,n, such that a similarity measure s(Q, I i ) is large, and s(Q, I i ) is consistent with human’s perception. Good try  but difficult to measure human’s perception; still a difficult research topic. Notice  We are talking more about quality of solution.

10 Problem Definition 5th trial: Given a query Q, retrieve images I i, i = 1,...,n, such that precision p and recall rate r are maximized.  each Q and I i contains one or more regions  p and r are performance indices  p and r are defined in terms of similarity s(Q, I i ) Now, the problem definition is more specific:  requirement of solution also given  still haven’t said what are s(Q, I i ), p, r

11 Problem Definition Good problem definition should include  inputs  expected outputs  relationships between inputs and outputs  requirements about outputs  performance measures

12 Formulate Solution Method What it is not...  not writing a program  not thinking of an algorithm  not thinking of what tools to use So what is it?  give more details to problem definition  use mathematics  divide the problem into sub-problems  map sub-problem into known problem with known solution methods

13 Formulate Solution Method Example: image retrieval First sub-problem: define similarity s(Q, I i )  Q has regions R k  I i has regions R ij  some R k are identical/similar to some R ij  some R k are not identical/similar to some R ij  need to find best matching pairs How to say all these in math?

14 Formulate Solution Method  One possibility: use a mapping function:  f: Q  I i  R ij = f(R k ) means R k corresponds to R ij  define region similarity s(R k, R ij )  s(R k, R ij ) is large if R k is similar to R ij  this is another sub-problem  s(R k, f(R k )) = match between R k  Q and corresponding f(R k )  I i

15 Formulate Solution Method First, define s f (Q, I i ): Given a query Q, an image I i, and a mapping function f, define s f (Q, I i ) as Then, define s(Q, I i ): Given a query Q and an image I i,

16 Formulate Solution Method Now, perform a “magic” which I call problem transformation.  Given a query Q and an image I i, find the mapping f such that s(Q, I i ) is maximized.  this is an optimization problem  Given a query Q and a set of images I i,  compute s(Q, I i ) by solving optimization problem  return images I i with large s(Q, I i ). Now, we have an algorithm!

17 Formulate Solution Method Sounds complicated. Do we really do all these?  Yes. The more complicated, the stronger the need. Further reading: my IJCAI 2001 paper on conceptual graph  map query and images to conceptual graphs; define problem as subgraph matching problem (problem transformation)  map subgraph matching to search problem (transform once more)  implement search algorithm

18 Formulate Solution Method Some challenges:  Define the problem that you’re working on using math.  Try to find an interesting CS research problem that cannot be defined using math.

19 Performance Evaluation Purpose  evaluate solution method  understand its strengths and weaknesses (with the aim of improving it)  determine values of system parameters

20 Performance Evaluation  What’s your impression if you see this: The accuracy of the method is 90%.  The statement is not very meaningful.  Is the method really good?  may be the problem is simple  may be the data is not representative  easy to differentiate elephant from apple; difficult to differentiate between African and Asian elephant

21 Performance Evaluation So what makes a good evaluation?  good test data  test various aspects of method  good test cases  expose strengths and weaknesses of method  good comparison  set baselines for assessing performance  compare with state-of-the-art

22 Performance Evaluation Example 1: Color histograms: finding good operating condition (i.e., parameter values)  100 colorful images  2 parameters:  radius R  separation ratio   2 performance measures:  mean number of color bins  mean error of color

23 Performance Evaluation

24 Performance Evaluation

25 Performance Evaluation good operating condition

26 Performance Evaluation Example 2: Estimation of surface normals of 3D points.  3 sets of data points  2 methods of getting good neighboring points: point-based (P), mesh-based (M)  2 methods of computing surface normals: PCA, linear extrapolation (LE)  baseline: PCA on raw 3D points

27 Performance Evaluation What can you conclude from the result?  M is better than P  LE is better than PCA  LE/M is the best

28 Performance Evaluation Example 3: Classification of images by color distributions.  100 classes, 20 samples each  2 methods of computing color histograms: clustered (c), adaptive (a)  4 dissimilarity measures:  L2, JD: only for c  EMD: too slow for c, so only for a  WC: ok for both c and a  baseline: c + L2

29 Performance Evaluation What can you conclude from the result?

30 final points...  Always relate to the big picture.  good research is never performed in isolation  Know the strengths & weaknesses of your tools.  nothing is perfect for everything  Understand the problem first.  choose the solution method last  Learn various methods and tools.  you never know when you need one

31 good readings...  H. S. Fogler & S. E. LeBlanc, Strategies for Creative Problem Solving, Prentice-Hall,  general ideas, no math  D. Huff, How To Lie With Statistics, W. W. Norton,  know how not to get cheated