Statistical Machine Translation

Slides:



Advertisements
Similar presentations
Languages for IT & CS Pseudo-code What HTML isn’t Early history Compiling & interpreting Classifying languages The process of programming.
Advertisements

LECTURE 1 CMSC 201. Overview Goal: Problem solving and algorithm development. Learn to program in Python. Algorithm - a set of unambiguous and ordered.
CS Section 600 CS Section 002 Dr. Angela Guercio Spring 2010.
CS211 Data Structures Sami Rollins Fall 2004.
Chapter 16 Programming and Languages: Telling the Computer What to Do.
Compilers and Interpreters. Translation to machine language Every high level language needs to be translated to machine code There are different ways.
Random numbers in Python Nobody knows what’s next...
CS 101 Problem Solving and Structured Programming in C Sami Rollins Spring 2003.
Computer Systems Nat 4/5 Computing Science Translator Programs.
Running Record Analysis Michelle Allan 9/9/14. Running Records provide an assessment of text reading. What…… They are designed to be taken as a child.
Machine Translators By: Holly Slemp. What Do They Do? Translate words from one language to another You can speak and translate words into another language.
COURSE OVERVIEW ADVANCED TEXT ANALYTICS Thomas Tiahrt, MA, PhD CSC492 – Advanced Text Analytics.
Introduction to Computer Programming Computer Programming I Introduction to Aerospace Created by The North Carolina School of Science and Math.The North.
Software Development Concepts ITEC Software Development Software Development refers to all that is involved between the conception of the desired.
Vocabulary List Builder Highlight words or enter words manually for creation of graphic organizer Creates a Word document modifiable with columns: o vocabulary.
Common Core State Standards Parent & Student Toolkit.
Algorithmic Problem Solving CMSC 201 Adapted from slides by Marie desJardins (Spring 2015 Prof Chang version)
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 7. Using Probability Theory to Produce Sampling Distributions.
Systems Development Lifecycle Testing and Documentation.
Model Oriented Programming: An Empirical Study of Comprehension Omar Badreddin Andrew Forward Timothy C. Lethbridge try.umple.org.
Problem Solving using the Science of Computing MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way.
IXA 1234 : C++ PROGRAMMING CHAPTER 1. PROGRAMMING LANGUAGE Programming language is a computer program that can solve certain problem / task Keyword: Computer.
An Investigation of Statistical Machine Translation (Spanish to English) Raghav Bashyal.
Intermediate 2 Computing Unit 2 - Software Development Topic 2 - Software Development Languages and Environments.
Statistical Machine Translation Raghav Bashyal. Statistical Machine Translation Uses pre-translated text (copora) Compare translated text to original.
8-3: Probability and Probability Distributions English Casbarro Unit 8.
Naïve Bayes Classifier Christina Wallin, Period 3 Computer Systems Research Lab Christina Wallin, Period 3 Computer Systems Research Lab
CSC 1010 Programming for All Lecture 2 Introduction to Python Some material based on material from Marty Stepp, Instructor, University of Washington.
The 30 th of October. “The Future Simple Tense”. Maths Christmas Art Spanish English to wear uniform science language Literature to study New Year.
Naïve Bayes Classification Christina Wallin Computer Systems Research Lab
Part-of-Speech Tagging with Limited Training Corpora Robert Staubs Period 1.
Implementation of Steganographic Techniques Danny Friedheim pd. 2.
Python 1 SIGCS 1 Intro to Python March 7, 2012 Presented by Pamela A Moore & Zenia C Bahorski 1.
Programming Objectives What is a programming language? Difference between source code and machine code What is python? – Where to get it from – How to.
Software Engineering Algorithms, Compilers, & Lifecycle.
Machine Language Computer languages cannot be directly interpreted by the computer – they are not in binary. All commands need to be translated into binary.
Zohreh Raghebi.  A software platform provides an integrated environment  Machine learning  Data mining  Text mining  Predictive analytics  Business.
Programming Language Basics. What is a Programming Language? “A computer, human-created language used to write instructions for a computer.” “An artificial.
Component 1.6.
Writing algorithms Introduction to Python.
Introduction to Computers and C++ Programming
The Art of Programming --
FIGURES FOR CHAPTER 1 GETTING STARTED
Internationalization
Introduction to Compiler Construction
Chapter 5- Assembling , Linking, and Executing Programs
Tools for Natural Language Processing Applications
Pamela Moore & Zenia Bahorski
Algorithm and Ambiguity
Model Governance Industry Evolution Beyond Model Accuracy
A451 Theory – 7 Programming 7A, B - Algorithms.
Transition to Code Upsorn Praphamontripong CS 1110
Natural Language Processing (NLP)
Internationalization
Algorithm and Ambiguity
and Executing Programs
High Level Programming Languages
Automated MS Word and PowerPoint Translator
LING 388: Computers and Language
Programming.
Programming Language Basics
Command Me Specification
Statistical n-gram David ling.
CS 1111 Introduction to Programming Spring 2019
Nat 4/5 Computing Science Translator Programs
Natural Language Processing (NLP)
C021TV-I3-S2.
Programming Fundamentals Lecture #2 Overview of Computer Programming
What is Coding? Code The instructions in a computer program.
Natural Language Processing (NLP)
Presentation transcript:

Statistical Machine Translation Raghav Bashyal

Statistical Machine Translation Uses pre-translated text (copora) Compare translated text to original Notice patterns, associate words

Project Translate basic text from Spanish to English Test effectiveness with/without hard-coded components (syntax) Specific procedures/algorithms that add speed

Literature Guides on Statistical Machine Translation NLTK UC Berkeley Christina Wallin UC Berkeley Modifications Larger text more useful “state of the art” implementation Google

Procedure NLTK – Natural Language ToolKit Python Made from Natural Language processing projects Current procedure – read the NLTK book Jump off and write relevant code

Expected Results Probably will be very basic translation Usually perform better with “sample” text than “real” text Highlighted errors Program should use reference data to find some errors Error frequency plots for certain words Test the effectiveness of adjustments Hard coding, other algorithms