The WinMine Toolkit Max Chickering. Build Statistical Models From Data Dependency Networks Bayesian Networks Local Distributions –Trees Multinomial /

Slides:



Advertisements
Similar presentations
What is GAMS?. While they are not NLP solvers, per se, attention should be given to modeling languages like: GAMS- AIMMS-
Advertisements

ACOT Intro/Copyright Succeeding in Business with Microsoft Excel 2010: Chapter1.
Tutorial 12: Enhancing Excel with Visual Basic for Applications
Mobyle XML Vivek Gopalan Version history: First version for training Nick and Art – Vivek, 02/07/2011.
Moving Data Lesson 23. Skills Matrix Moving Data When populating tables by inserting data, you will discover that data can come from various sources.
FILE TRANSFER PROTOCOL Short for File Transfer Protocol, the protocol for exchanging files over the Internet. FTP works in the same way as HTTP for transferring.
Introduction to SPSS Allen Risley Academic Technology Services, CSUSM
Query Manager. QM is a collection of tools you can use to obtain information from the AS/400 database Used to –select, arrange, and analyze information.
Today: Run SAS programs on Saturn (UNIX tutorial) Runs SAS programs on the PC.
Programming Fundamentals. Programming concepts and understanding of the essentials of programming languages form the basis of computing.
WEKA (sumber: Machine Learning with WEKA). What is WEKA? Weka is a collection of machine learning algorithms for data mining tasks. Weka contains.
General Computer Science for Engineers CISC 106 Lecture 04 Roger Craig Computer and Information Sciences 9/11/2009.
Lesson 22 – Introduction to Linux Systems Administration.
Linux+ Guide to Linux Certification, Second Edition
Automating Tasks With Macros. 2 Design a switchboard and dialog box for a graphical user interface Database developers interact directly with Access.
T UTORIAL OF U NIX C OMMAND & SHELL SCRIPT S 5027 Professor: Dr. Shu-Ching Chen TA: Samira Pouyanfar Spring 2015.
LATTICE TECHNOLOGY, INC. For Version 1.3 and later XVL BOM Assembler Tutorial For Version 1.3 and later.
Overview of Search Engines
CGI Programming: Part 1. What is CGI? CGI = Common Gateway Interface Provides a standardized way for web browsers to: –Call programs on a server. –Pass.
Module 2: Using Transact-SQL Querying Tools. Overview SQL Query Analyzer Using the Object Browser Tool in SQL Query Analyzer Using Templates in SQL Query.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Binomial Test PowerPoint Prepared by Alfred P.
Advanced File Processing
PHP Tutorials 02 Olarik Surinta Management Information System Faculty of Informatics.
Introduction to Databases Chapter 6: Understanding the SQL Language.
Chapter 9: MuPAD Programming II Procedures MATLAB for Scientist and Engineers Using Symbolic Toolbox.
Additional UNIX Commands. 222 Lecture Overview  Multiple commands and job control  More useful UNIX utilities.
UNIX Commands. Why UNIX Commands Are Noninteractive Command may take input from the output of another command (filters). May be scheduled to run at specific.
Laboratory for Computational Intelligence, University of British Columbia Belief & Decision Networks Stochastic Local Search Neural NetworksGraph Searching.
Verified Network Configuration. Verinec Goals Device independent network configuration Automated testing of configuration Automated distribution of configuration.
2006 Census of Population and Dwellings Proposed Products and Services.
W E K A Waikato Environment for Knowledge Analysis Branko Kavšek MPŠ Jožef StefanNovember 2005.
Introduction to Enterprise Guide Jennifer Schmidt Rhonda Ellis Cassandra Hall.
System Administration HW2 Shell Script xclin. Computer Center, CS, NCTU 2 Requirements  Xferlog statistics (15%) use one-line command to show FTP transfer.
JSTL The JavaServer Pages Standard Tag Library (JSTL) is a collection of useful JSP tags which encapsulates core functionality common to many JSP applications.
Priya Ramaswami Janssen R&D US. Advantages of PROC REPORT -Very powerful -Perform lists, subsets, statistics, computations, formatting within one procedure.
1 Chapter 34 Internet Applications (Telnet, FTP).
A Level Computing#BristolMet Session ObjectivesU2#S12 MUST describe the terms modal and pretty printing in term of input and output facilities. SHOULD.
XP New Perspectives on Microsoft Office Access 2003 Tutorial 10 1 Microsoft Office Access 2003 Tutorial 10 – Automating Tasks With Macros.
D. Heynderickx DH Consultancy, Leuven, Belgium 22 April 2010EuroPlanet, London, UK.
16. Python Files I/O Printing to the Screen: The simplest way to produce output is using the print statement where you can pass zero or more expressions,
Chapter 14 Formatting Readable Output. Chapter Objectives  Add a column heading with a line break to a report  Format the appearance of numeric data.
Lesson 3-Touring Utilities and System Features. Overview Employing fundamental utilities. Linux terminal sessions. Managing input and output. Using special.
W E K A Waikato Environment for Knowledge Aquisition.
The HDF Group 10/17/151 HDF5 Tools Tutorial ICALEPCS 2015.
Autumn School Dynamic MSM16-18 November 2015 | L-Esch-sur-Alzette Slide 1 4. Basic concepts, a rudimentary model, and input data (demo01.yml)
Linux+ Guide to Linux Certification, Second Edition Chapter 4 Exploring Linux Filesystems.
Announcements Assignment 1 due Wednesday at 11:59PM Quiz 1 on Thursday 1.
Dept. of Animal Breeding and Genetics Programming basics & introduction to PERL Mats Pettersson.
AdaptJ Sookmyung Women’s Univ. PSLAB. 1. 목차 1. Overview 2. Collecting Trace Data using the AdaptJ Agent 2.1 Recording a Trace 3. Analyzing Trace Data.
Aggregator Stage : Definition : Aggregator classifies data rows from a single input link into groups and calculates totals or other aggregate functions.
CSC 4630 Perl 3 adapted from R. E. Beck. Problem But we worked on it first: Input: Read from a text file named in a command line argument Output: List.
LATTICE TECHNOLOGY, INC. For Version 1.0 and later XVL BOM Assembler Tutorial For Version 1.0 and later.
File Management commands cat Cat command cat cal.txt cat command displays the contents of a file here cal.txt on screen (or standard out).
Splunk Enterprise Instructor: Summer Partain 3 Day Course.
Copyright 2009 The Little Engine That Could: Using EXCEL LIBNAME Engine Options to Enhance Data Transfers between SAS® and Microsoft® Excel Files William.
SG Introduction to ANT scmGalaxy Author: Rajesh Kumar
SIMPLE FILTERS. CONTENTS Filters – definition To format text – pr Pick lines from the beginning – head Pick lines from the end – tail Extract characters.
Tutorial of Unix Command & shell scriptS 5027
NETSTORM.
Chapter 7 Text Input/Output Objectives
Chapter 7 Text Input/Output Objectives
Tutorial of Unix Command & shell scriptS 5027
Tutorial of Unix Command & shell scriptS 5027
Weka Package Weka package is open source data mining software written in Java. Weka can be applied to your dataset from the GUI, the command line or called.
Tutorial of Unix Command & shell scriptS 5027
Tutorial for WEKA Heejun Kim June 19, 2018.
GNU DEBUGGER TOOL. What is the GDB ? GNU Debugger It Works for several languages – including C/C++ [Assembly, Fortran,Go,Objective-C,Pascal]
Eviews Tutorial for Labor Economics Lei Lei
CST8177 Scripting 2: What?.
Presentation transcript:

The WinMine Toolkit Max Chickering

Build Statistical Models From Data Dependency Networks Bayesian Networks Local Distributions –Trees Multinomial / Binary Multinomial Gaussian / Binary Gaussian Log Gaussian / Binary Log Gaussian –Complete Tables

Data Processing Tools DataConverter.exe (Interactive) Convert raw text or SQL data into XML format DataCheck.exe (Command-line) Extract basic statistics from data DataJoin.exe (Command-line) Perform a join between two datasets DataSplit.exe (Command-line) Split data into train/test

Modeling Tools PlanEditor.exe (Interactive) Specify roles (e.g. input vs output) and distributions for variables Dnet.exe (Command line) Build a dependency network or Bayesian network from data DnetBrowser.exe (Interactive) Interactively browse dependency network or Bayesian network DnetLogscore.exe (Command Line) Evaluate Prediction accuracy of models

Built-In Help: -help Option c:\WinMine Toolkit\Bin>datacheck -help This executable parses a data file and prints out summary statistics. If a marginal statistics file is provided with the '-marg' flag, the executable collects marginal counts for each variable and prints them to that file.

Built-In Help: No Arguments c:\WinMine Toolkit\Bin>datacheck Error in command line: required argument '-data' not given syntax for datacheck: Flag Type Description Optional? Default data string Data file no -report string Report file yes -marg string Marginal counts file yes -silent bool Suppress progress output yes false -help bool Display help yes false c:\WinMine Toolkit\Bin>

Interactive Mode c:\WinMine Toolkit\Bin>datacheck -gui

WinMine Home Page: Download/Update tools No registry changes: simply copies executables Online Tutorial Steps through using all of the tools with a simple example Discussion Group