Contraceptive Method Choice 指導教授黃三益博士組員 :B924020007 王俐文 B924020009 謝孟凌 B924020014 陳怡珺.

Slides:

Advertisements

Similar presentations

Historical Changes in Stay-at-Home Mothers: 1969 to 2009 American Sociological Association Annual Meeting Atlanta, GA August 14-17, 2010 Rose M. Kreider,

Advertisements

Significance Testing.  A statistical method that uses sample data to evaluate a hypothesis about a population  1. State a hypothesis  2. Use the hypothesis.

String Searching Algorithm

Final Project- Mining Mushroom World. Agenda Motivation and Background Determine the Data Set (2) 10 DM Methodology steps (19) Conclusion.

Early Marriage in Egypt: Field Research El Nadeem Center 18- June

Web-based Spoken English Training System with American Accent 線上美式英文口音訓練系統指導教授：陳恆佑老師學生：王舜霈 Date: June 25th, 2008 國立暨南國際大學資訊工程學系碩士班畢業成果展 1.

Machine Learning in Practice Lecture 3 Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.

The Influence of Perceptions of Community Norms on Current Contraceptive Use among Men and Women in Ethiopia and Kenya Michelle Dynes 1. Rob Stephenson.

Introduction to Data Mining with XLMiner

A Classification Approach for Effective Noninvasive Diagnosis of Coronary Artery Disease Advisor: 黃三益教授 Student: 李建祥 D 楊宗憲 D 張珀銀 D

Spatial and Temporal Data Mining V. Megalooikonomou Introduction to Decision Trees ( based on notes by Jiawei Han and Micheline Kamber and on notes by.

1 Predicting the winner of C.Y. award 指導教授：黃三益博士組員：尹川陳隆賢陳偉聖.

ML ALGORITHMS. Algorithm Types Classification (supervised) Given -> A set of classified examples “instances” Produce -> A way of classifying new examples.

SINGLE VARIABLE DATA DEFINITIONS ETC. GENERAL STUFF STATISTICS IS THE PROCESS OF GATHERING, DISPLAYING, AND ANALYZING DATA. DATA CAN BE GATHERED BY CONDUCTING.

SOCIODEMOGRAPHIC VARIABLES IN PROFILING WELLNESS TOURISTS Ana Težak Damijanić Pavlo Ružić Institute of Agriculture and Tourism.

指導教授：黃三益教授學生： M 陳聖現 M 王啟樵 M 呂佳如.

Dr. Donald Kraft (Chair)

1 Immigrant Economic and Social Integration in Canada: Research, Measurement, Data Development By Garnett Picot Director General Analysis Branch Statistics.

1 Chapter Seven Introduction to Sampling Distributions Section 1 Sampling Distribution.

Cascaded Integrator Comb Filter 長庚電機通訊組碩一張晉銓指導教授 : 黃文傑博士.

An Overview of Statistical Inference – Learning from Data

Repeat Pregnancy in HIV Positive Indian Women Nishi Suryavanshi 1 Ashwini Erande 1, Hemlata Pisal 1, Anita V. Shankar 2, Robert C. Bollinger 3, Mrudula.

Workshop on the Improvement of Civil Registration and Vital Statistics in the SADC Region, Blantyre, Malawi, 1 – 5 December 2008 Vital statistics and their.

DATA MINING FINAL REPORT Vipin Saini M 許博淞 M 陳昀志 M

Photographic surveying of minority carrier diffusion length in polycrystalline silicon solar cells by electroluminescence 指導老師：林克默博士黃文勇博士學生：郭怡彣.

Chapter 6 Lecture 3 Sections: 6.4 – 6.5.

 Mail Order Company in USA › Would like to find out if there is a way › To reduce mailing cost › By analyzing the past data.

Designing Survey Instruments. Creating a Survey Instrument  Survey instruments should help researchers collect the most accurate data and reach the most.

Male Method Choice in Bangladesh: Does It Matter Who Makes The Decision? Mohammad Amirul Islam Sabu S. Padmadas Peter W.F. Smith Division of Social Statistics.

Analysis and Write up Inter-agency Child Protection Working Group & Save the Children Picture by: Lindsay Stark Training material developed by: Hani Mansourian.

Introduction of Data Prepared by: Bhakti Joshi Date: November 11, 2011.

Will how tall you are tell us what size shoe you wear?

1 1 Anonymised Integrated Event History Datasets for Researchers Johan Heldal Statistics Norway.

Ch11. Team Members 指導教授 : 郭育仁組員 : M 蘇文呈 M 吳孟珊 M 陳苾鈐 M 蔡念倢.

Chapter 1 Statistics by Mohamed ELhusseiny

Chapter 3 Data Mining Methodology and Best Practices

Software Defined Radios 長庚電機通訊組碩一張晉銓指導教授 : 黃文傑博士.

Consistency in reporting contraception between spouses in Bangladesh: evidence from recent demographic and health survey Mohammad Amirul Islam Sabu S.

Surveillance and Surveys Higher Blood pressure among Inuit migrants in Denmark than among Inuit in Greenland Bjerregaard et al.

AP Statistics: ANOVA Section 1. In section 13.1 A, we used a t-test to compare the means between two groups. An ANOVA (ANalysis Of VAriance) test is used.

2002 Spring Data Mining Term Project Proposal Data Mining Experimental Study with Oracle & MS SQL 012ITI12 Song Mi-Kyoung.

1 1 Topics difficult to measure in a register-based census Harald Utne Census Project Statistics Norway UNECE-Eurostat Meeting on Population.

CIS671-Knowledge Discovery and Data Mining Vasileios Megalooikonomou Dept. of Computer and Information Sciences Temple University AI reminders (based on.

Konstantina Christakopoulou Liang Zeng Group G21

Chapter 6 Lecture 3 Sections: 6.4 – 6.5. Sampling Distributions and Estimators What we want to do is find out the sampling distribution of a statistic.

Software Defined Radios 長庚電機通訊組碩一張晉銓指導教授 : 黃文傑博士.

Using Classification Trees to Decide News Popularity

TUVALU DEMOGRAPHIC AND HEALTH SURVEY OUTLINE  Background  Questionnaire  Sensitive questions  Training  Indicators.

Review of Statistical Terms Population Sample Parameter Statistic.

7.2 Means and Variances of Random Variables, cont.

Collection and Preservation Management. Surveys  Overall Preservation Survey  ID Global Concerns  Ideal vs. Reality  Collections Survey  All  Subset.

1 Determinants of women's autonomy over sexual behaviors within marital relationships in contemporary Vietnam Hongyun Fu, MA Mai Do, MD, DrPH Lung Duy.

Prominent in English Teaching for Taiwan EFL Learning 指導教授 : 鍾榮富高師大博士生范春銀.

INTER-SPOUSE COMMUNICATION AND CONTRACEPTIVE BEHAVIOR IN CAMEROON: A COUPLE-BASED ANALYSIS MBELLA MBELLA Cédric Stéphane Ministry of Economy, Planning.

Which socio-demographic living arrangement helps to reach 100? Michel POULAIN & Anne HERM Orlando 8 January 2014.

Data Mining: Data Prepossessing What is to be done before we get to Data Mining?

論文作者：林三賢教授廖振程博士報告者：吳志偉指導老師：陳正宗終身特聘教授

Chapter Six Normal Curves and Sampling Probability Distributions

JUS 510 Education for Service/tutorialrank.com

The 2006 Survey of Violence against Women and Children in Indonesia

The European Statistical Training Programme (ESTP)

Optimization of Wireless Station Time Slot Allocation with Consideration of Throughput and Delay Constraints 指導教授：林永松博士研究生：林岦毅.

بسمه تعالی کارگاه ارزشیابی پیشرفت تحصیلی

(Checking vital event Iran)

MGSE7.SP.3/MGSE7.SP.4: I can use measure of center and measures of variability for numerical data from random samples to draw informal comparative inferences.

Recommended Population and Housing Census Topics 2020

Chapter 5: The analysis of nonresponse

Presentation transcript:

Contraceptive Method Choice 指導教授黃三益博士組員 :B 王俐文 B 謝孟凌 B 陳怡珺

Background and Motivation Population of the world increases tremendously, people of present day pay more attention to contraceptive method.

Step one: Translate the Business Problem into a Data Mining Problem Topic: Contraceptive Method Choice Predict the current contraceptive method choice (no use, long-term methods, or short-term methods) of a woman based on her demographic and socio- economic characteristics. Especially what kind of couples would chose long- term method.

Step two: Select Appropriate Data Title: Contraceptive Method Choice Sources:  Origin: Subset of the 1987 National Indonesia Contraceptive Prevalence Survey  Creator: Tjen-Sien Lim  Date: June 7, 1997

Step two: Select Appropriate Data Number of Instances: 1473 There is no missing value in this dataset.

Step two: Select Appropriate Data Number of attributes: 10 (including the class attribute) Wife's age Wife's education Husband's education Number of children ever born Wife's religion Wife's now working? Husband's occupation Standard-of-living index Media exposure Contraceptive method used (class attribute)

Step three: Get to Know the Data Attribute Information Attribute NameAttribute TypeDescription of Attribute Value Contraceptive method used class attribute1=No-use 2=Long-term 3=Short-term

Step three: Get to Know the Data Attribute Information Attribute NameAttribute TypeDescription of Attribute Value Wife's ageNumerical

Step three: Get to Know the Data Attribute Information Attribute NameAttribute TypeDescription of Attribute Value Wife's educationCategorical1=low 2, 3, 4=high

Step three: Get to Know the Data Attribute Information Attribute NameAttribute TypeDescription of Attribute Value Husband's educationCategorical1=low 2, 3, 4=high

Step three: Get to Know the Data Attribute Information Attribute NameAttribute TypeDescription of Attribute Value Number of children ever born Numerical

Step three: Get to Know the Data Attribute Information Attribute NameAttribute TypeDescription of Attribute Value Wife's religionBinary0=Non-Islam 1=Islam

Step three: Get to Know the Data Attribute Information Attribute NameAttribute TypeDescription of Attribute Value Wife's now working?Binary0=Yes 1=No

Step three: Get to Know the Data Attribute Information Attribute NameAttribute TypeDescription of Attribute Value Husband's occupationCategorical1, 2, 3, 4

Step three: Get to Know the Data Attribute Information Attribute NameAttribute TypeDescription of Attribute Value Standard-of-living indexCategorical1=low 2, 3, 4=high

Step three: Get to Know the Data Attribute Information Attribute NameAttribute TypeDescription of Attribute Value Media exposureBinary0=Good 1=Not good

Step Four : Create a Model Set Raw Data

Step Four : Create a Model Set Total 1473 samples 75% of the data as training set the rest of the data as testing set →By random sampling Rapid Miner

Step Five: Fix Problems with the Data No missing value Skewed distributions

Step Six : Transform Data to Bring Information to the Surface most of the values of the attribute named Media Exposure are “Good” the numeric variables to do the statistical analysis to finding outliers

Step7 Build Model By RapidMiner, build it with Decision Tree

Step7 Build Model(con’t)

Ripper Rule if wife_age > 30 and Num_children_born <= 1 then 1 (53 / 1 / 3) if Num_children_born <= 0 then 1 (36 / 0 / 0) if Wife_education = 4 and wife_age 3 then 2 (0 / 14 / 0) if Wife_education = 1 and Husband_occupation = 2 then 1 (17 / 0 / 1) if Wife_education = 4 and wife_age > 33 and Num_children_born > 2 and Husband_occupation = 1 and Num_children_born <= 3 then 2 (1 / 10 / 2) Step7 Build Model(con’t)

if Num_children_born > 2 and wife_age 28 then 3 (1 / 0 / 13) if wife_age 4 and Media_exposure = 0 then 3 (1 / 2 / 12) if Husband_education = 4 and wife_age 37 then 2 (0 / 5 / 0) else 1 (305 / 168 / 281) Step7 Build Model(con’t)

Weka-JRip (Wife_education = 4) and (Num_children_born >= 3) and (wife_age >= 35) => method_used=2 (178.0/76.0) (wife_age = 3) => method_used=3 (271.0/120.0) (wife_age = 1) and (wife_age method_used=3 (106.0/51.0) => method_used=1 (771.0/342.0) Step7 Build Model(con’t)

Step 8 Assess Model Decision Tree

Step 8 Assess Model(con’t) Ripper Rule

Step 8 Assess Model(con’t) JRip Rule

Conclusion Result The problems we should improve  more data  ignore some attributes  details of the attribute are not so clear  period and environment have changed

Thanks for you listening…