2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, 2008 1 Frame an IR Research Problem and Form Hypotheses ChengXiang Zhai Department.

Slides:



Advertisements
Similar presentations
ACM SIGIR 2009 Workshop on Redundancy, Diversity, and Interdependent Document Relevance, July 23, 2009, Boston, MA 1 Modeling Diversity in Information.
Advertisements

2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, Write and Publish an IR Paper ChengXiang Zhai Department of Computer.
1.Accuracy of Agree/Disagree relation classification. 2.Accuracy of user opinion prediction. 1.Task extraction performance on Bing web search log with.
Querying for Information Integration: How to go from an Imprecise Intent to a Precise Query? Aditya Telang Sharma Chakravarthy, Chengkai Li.
Raymond Martin What is Research? “A STUDIOUS ENQUIRY or examination especially a critical and exhaustive investigation or experimentation.
2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, Introduction to IR Research ChengXiang Zhai Department of Computer.
Scientific Research Dr. Noura Al-dayan.
Lecture 2 Page 1 CS 236, Spring 2008 Security Principles and Policies CS 236 On-Line MS Program Networks and Systems Security Peter Reiher Spring, 2008.
IR Challenges and Language Modeling. IR Achievements Search engines  Meta-search  Cross-lingual search  Factoid question answering  Filtering Statistical.
Deanery of Business & Computer Sciences Research Methods Week 1 Collecting, Processing and Analyzing Data.
1 Module 2: Fundamental Concepts Problems Programs –Programming languages.
Lecture 2: Fundamental Concepts
An Overview of Text Mining Rebecca Hwa 4/25/2002 References M. Hearst, “Untangling Text Data Mining,” in the Proceedings of the 37 th Annual Meeting of.
INFO 624 Week 3 Retrieval System Evaluation
Chapter 1 Conducting & Reading Research Baumgartner et al Chapter 1 Nature and Purpose of Research.
Health Informatics Series
PROBLEM FORMULATION Defining a Researchable Problem Research Methods College of Public and Community Service University of Massachusetts at Boston ©2011.
Research problem, Purpose, question
Marketing Research and Information Systems
Introduction to IR Research Methodology
Internet Research Finding Free and Fee-based Obituaries Online.
Dr. Alireza Isfandyari-Moghaddam Department of Library and Information Studies, Islamic Azad University, Hamedan Branch
Temporal Event Map Construction For Event Search Qing Li Department of Computer Science City University of Hong Kong.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
4.04 Understand marketing- research activities to show command of their nature and scope.
2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, Pick a Good IR Research Problem ChengXiang Zhai Department of Computer.
Evaluation of digital collections' user interfaces Radovan Vrana Faculty of Humanities and Social Sciences Zagreb, Croatia
Soc 3306a Lecture 4 The Research Report and the Literature Review.
Using a Variety of Technologies to Teach Compute Hardware Background Approach  Quizzes  Web quests  Basic programming  Raspberry Pi Results Conclusions.
2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, Prepare Yourself for IR Research ChengXiang Zhai Department of Computer.
2008 © ChengXiang Zhai 1 Introduction to IR Research ChengXiang Zhai Department of Computer Science Graduate School of Library & Information Science Institute.
IR1IMEM YEAR ONE RESEARCH METHODOLOGY L2: Reviewing Literature, Formulating Research Problem, Variables DR. JAVED-VASSILIS KHAN, Drs. Allerd Peters, Frank.
1 Information Filtering & Recommender Systems (Lecture for CS410 Text Info Systems) ChengXiang Zhai Department of Computer Science University of Illinois,
Survey Of Music Information Needs, Uses, and Seeking Behaviors Jin Ha Lee J. Stephen Downie Graduate School of Library and Information Science University.
2008 © ChengXiang Zhai 1 Introduction to Research ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign
 Text Representation & Text Classification for Intelligent Information Retrieval Ning Yu School of Library and Information Science Indiana University.
Problem Definition Chapter 7. Chapter Objectives Learn: –The 8 steps of experienced problem solvers –How to collect and analyze information and data.
Automatically Generating Gene Summaries from Biomedical Literature (To appear in Proceedings of PSB 2006) X. LING, J. JIANG, X. He, Q.~Z. MEI, C.~X. ZHAI,
Institute of Professional Studies School of Research and Graduate Studies Introduction to Business and Management Research Lecture One (1)
Real World IR Challenges (CS598-CXZ Advanced Topics in IR Presentation) Jan. 20, 2005 ChengXiang Zhai Department of Computer Science University of Illinois,
Science Fair How To Get Started… (
Problem solving methodology Information Technology Units Adapted from VCAA Study Design - Information Technology Byron Mitchell, November.
ICT IGCSE.  Introducing or changing a system needs careful planning  Why?
Reviewing the Literature and Developing Research Questions You will be able to: Identify research problems. Explain why it is necessary to conduct a literature.
Toward A Session-Based Search Engine Smitha Sriram, Xuehua Shen, ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Instructore: Tasneem Darwish1 University of Palestine Faculty of Applied Engineering and Urban Planning Software Engineering Department Requirement engineering.
1 The Theoretical Framework. A theoretical framework is similar to the frame of the house. Just as the foundation supports a house, a theoretical framework.
Developing a Research Question Judy Zerzan, MD, MPH July 5, 2005.
Basics of Biostatistics for Health Research Session 1 – February 7 th, 2013 Dr. Scott Patten, Professor of Epidemiology Department of Community Health.
Opportunities for Text Mining in Bioinformatics (CS591-CXZ Text Data Mining Seminar) Dec. 8, 2004 ChengXiang Zhai Department of Computer Science University.
2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, 龙星计划课程 : 信息检索 Course Summary ChengXiang Zhai ( 翟成祥 ) Department of.
Active Feedback in Ad Hoc IR Xuehua Shen, ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Chapter. 3: Retrieval Evaluation 1/2/2016Dr. Almetwally Mostafa 1.
Research Word has a broad spectrum of meanings –“Research this topic on ….” –“Years of research has produced a new ….”
Chapter 06 Marketing Research and Information Systems Part Three Target Market Selection and Research.
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
Framing a research question Chitra Grace A Scientist- C (PGDHE) NIE, Chennai RM Workshop for ICMR Scientists 01/11/2011.
Toward Entity Retrieval over Structured and Text Data Mayssam Sayyadian, Azadeh Shakery, AnHai Doan, ChengXiang Zhai Department of Computer Science University.
A Study of Poisson Query Generation Model for Information Retrieval
Survey of Digitization INF 385R - Megan Winget Fridays, INF 385R - Megan Winget Fridays,
1 Prepared by: Laila al-Hasan. 1. Definition of research 2. Characteristics of research 3. Types of research 4. Objectives 5. Inquiry mode 2 Prepared.
Shuang Wu REU-DIMACS, 2010 Mentor: James Abello. Project description Our research project Input: time data recorded from the ‘Name That Cluster’ web page.
Human Computer Interaction Lecture 21 User Support
Introduction to Research
The scope and focus of the Research
Introduction to IR Research
PROBLEM FORMULATION Defining a Researchable Problem Research Methods
4.00 Understand promotion and intermediate uses of marketing-information Understand marketing-research activities to show command of their nature.
Introduction to Research
Course Summary ChengXiang “Cheng” Zhai Department of Computer Science
Presentation transcript:

2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, Frame an IR Research Problem and Form Hypotheses ChengXiang Zhai Department of Computer Science Graduate School of Library & Information Science Institute for Genomic Biology, Statistics University of Illinois, Urbana-Champaign

2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, General Steps to Define a Research Problem Generate and Test Raise a question Novelty test: Figure out to what extent we know how to answer the question –There’s already an answer to it: Is the answer good enough? Yes: not interesting, but can you make the question more challenging? No: your research problem is how to get a better answer to the raised question –No obvious answer: you’ve got an interesting problem to work on Tractability test: Figure out whether the raised question can be answered –I can see a way to answer it or potentially answer it: you’ve got a solvable problem –I can’t easily see a way to answer it: Is it because the question is too hard or you’ve not worked hard enough? Try to reframe the problem to make it easier Evaluation test: Can you obtain a data set and define measures to test solutions/answers? –Yes: you’ve got a clearly defined problem to work on –No: can you think of anyway to indirectly test the solutions/answers? Can you reframe the problem to fit the data? Every time you reframe a problem, try to do all the three tests again.

2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, Rigorously Define Your Research Problem Exploratory: what is the scope of exploration? What is the goal of exploration? Can you rigorously answer these questions? Descriptive: what does it look like? How does it work? Can you formally define a principle? Evaluative: can you clearly state the assumptions about data collection? Can you rigorously define measures? Explanatory: how can you rigorously verify a cause? Predictive: can you rigorously define what prediction is to be made?

2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, Frame a New Computation Task Define basic concepts Specify the input Specify the output Specify any preferences or constraints

2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, Map of IR Applications Web pages News articles messages Literature Organization docs Legal docs/Patents Medical records Customer complaint letter/transcripts … Kids Peking Univ. community LawyersScientists SearchBrowsingAlertMining Task/Decision support Customer Service People management + automatic reply “Google Kids” Legal Info Systems Literature Assistant Intranet Search Local Web Service Blog articles Online Shoppers ?

2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, From a new application to a clearly defined research problem Try to picture a new system, thus clarify what new functionality is to be provided and what benefit you’ll bring to a user Among all the system modules, which are easy to build and which are challenging? Pick a challenge and try to formalize the challenge –What exactly would be the input? –What exactly would be the output? Is this challenge really a new challenge (not immediately clear how to solve it)? –Yes, your research problem is how to solve this new problem –No, it can be reduced to some known challenge: are existing methods sufficient? Yes, not a good problem to work on No, your research problem is how to extend/adapt existing methods to solve your new challenge Tuning the problem

2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, Tuning the Problem Level of Challenges Impact/Usefulness Known Unknown Make a hard problem easier Make an easy problem harder Increase impact (more general)

2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, Examples of Problem Formulation Risk minimization framework Study of smoothing Axiomatic retrieval framework Comparative Text Mining Contextual PLSA Opinion Integration

2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, Form Research Hypotheses Typical hypotheses in IR: –Hypothesis about user characteristics (tested with user studies or user- log analysis, e.g., clickthrough bias) –Hypothesis about data characteristics (tested with fitting actual data, e.g., Zipf’s law) –Hypothesis about methods (tested with experiments): Method A works (or doesn’t work) for task B under condition C by measure D (feasibility) Method A performs better than method A’ for task B under condition C by measure D (comparative) Introduce baselines naturally lead to hypotheses Carefully study existing literature to figure our where exactly you can make a new contribution (what do you want others to cite your work as?) The more specialized a hypothesis is, the more likely it’s new, but a narrow hypothesis has lower impact than a general one, so try to generalize as much as you can to increase impact But avoid over-generalize (must be supported by your experiments) Tuning hypotheses (next lecture)

2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, Next Lecture (June 26): Test/Refine Hypothese