Building up Corpus of Technical Vocabulary – Strategies and Feasibility Presenters: Dr. Aparna Palle, Preetha Anthony GNITS, HYDERABAD.

Slides:



Advertisements
Similar presentations
Agenda Definitions Evolution of Programming Languages and Personal Computers The C Language.
Advertisements

Uses of a Corpus “[E]xplore actual patterns of language use”
The Assembly Language Level
Programming Languages Marjan Sirjani 2 2. Language Design Issues Design to Run efficiently : early languages Easy to write correctly : new languages.
CHAPTER 1: AN OVERVIEW OF COMPUTERS AND LOGIC. Objectives 2  Understand computer components and operations  Describe the steps involved in the programming.
Fundamental of Programming(101) Why study Programming Language Concepts? Increased capacity to express programming concepts Improved background for choosing.
VBA Modules, Functions, Variables, and Constants
Chapter3: Language Translation issues
Chapter 3 Program translation1 Chapt. 3 Language Translation Syntax and Semantics Translation phases Formal translation models.
About the Presentations The presentations cover the objectives found in the opening of each chapter. All chapter objectives are listed in the beginning.
JavaScript, Third Edition
Programming Logic and Design, Introductory, Fourth Edition1 Understanding Computer Components and Operations (continued) A program must be free of syntax.
Chapter 1 Program Design
Introduction to C Programming
Programming Logic and Design Fourth Edition, Introductory
1.3 Executing Programs. How is Computer Code Transformed into an Executable? Interpreters Compilers Hybrid systems.
Programming Concepts MIT - AITI. Variables l A variable is a name associated with a piece of data l Variables allow you to store and manipulate data in.
Memory Strategy – Using Mental Images
CS102 Introduction to Computer Programming
1 Chapter One A First Program Using C#. 2 Objectives Learn about programming tasks Learn object-oriented programming concepts Learn about the C# programming.
A First Program Using C#
Invitation to Computer Science 5th Edition
1 Chapter 5: Names, Bindings and Scopes Lionel Williams Jr. and Victoria Yan CSci 210, Advanced Software Paradigms September 26, 2010.
McEnery, T., Xiao, R. and Y.Tono Corpus-based language studies. Routledge. Unit A 2. Representativeness, balance and sampling (pp13-21)
Online Corpora in L2 Writing Class Zawan Al Bulushi Indiana University Bloomington November 15,
Simple Program Design Third Edition A Step-by-Step Approach
CIS Computer Programming Logic
Purpose of study A high-quality computing education equips pupils to use computational thinking and creativity to understand and change the world. Computing.
IT253: Computer Organization Lecture 4: Instruction Set Architecture Tonga Institute of Higher Education.
Researching language with computers Paul Thompson.
IPC144 Introduction to Programming Using C Week 1 – Lesson 2
DEPARTMENT OF COMPUTER SCIENCE & TECHNOLOGY FACULTY OF SCIENCE & TECHNOLOGY UNIVERSITY OF UWA WELLASSA 1 CST 221 OBJECT ORIENTED PROGRAMMING(OOP) ( 2 CREDITS.
1 CSC103: Introduction to Computer and Programming Lecture No 6.
Input, Output, and Processing
CPS120: Introduction to Computer Science
Constants Numeric Constants Integer Constants Floating Point Constants Character Constants Expressions Arithmetic Operators Assignment Operators Relational.
Unit-1 Introduction Prepared by: Prof. Harish I Rathod
Property of Jack Wilson, Cerritos College1 CIS Computer Programming Logic Programming Concepts Overview prepared by Jack Wilson Cerritos College.
An Object-Oriented Approach to Programming Logic and Design Chapter 1 An Overview of Computers and Logic.
Copyright © Curt Hill Languages and Grammars This is not English Class. But there is a resemblance.
Introduction to Problem Solving. Steps in Programming A Very Simplified Picture –Problem Definition & Analysis – High Level Strategy for a solution –Arriving.
Introduction to Programming Languages S1.3.1Bina © 1998 Liran & Ofir Introduction to Programming Languages Programming in C.
1 Chapter 3 Syntax, Errors, and Debugging Fundamentals of Java: AP Computer Science Essentials, 4th Edition Lambert / Osborne.
What is C? C is a programming language. It was developed in 1972 USA. It was designed and written by a man named dennis ritchie. C is the base for all.
1. 2 Preface In the time since the 1986 edition of this book, the world of compiler design has changed significantly 3.
The Software Development Process
Chapter 3 Syntax, Errors, and Debugging Fundamentals of Java.
CONSTANTS Constants are also known as literals in C. Constants are quantities whose values do not change during program execution. There are two types.
Software Development Problem Analysis and Specification Design Implementation (Coding) Testing, Execution and Debugging Maintenance.
FOUNDATION IN INFORMATION TECHNOLOGY (CS-T-101) TOPIC : INFORMATION SYSTEM – SOFTWARE.
Programming Fundamentals. Overview of Previous Lecture Phases of C++ Environment Program statement Vs Preprocessor directive Whitespaces Comments.
Colorado State University
Levels of Linguistic Analysis
CPS120: Introduction to Computer Science Variables and Constants.
1 The Software Development Process ► Systems analysis ► Systems design ► Implementation ► Testing ► Documentation ► Evaluation ► Maintenance.
An overview of C Language. Overview of C C language is a general purpose and structured programming language developed by 'Dennis Ritchie' at AT &T's.
Chapter 1: Preliminaries Lecture # 2. Chapter 1: Preliminaries Reasons for Studying Concepts of Programming Languages Programming Domains Language Evaluation.
Use of Concordancers A corpus (plural corpora) – a large collection of texts, written or spoken, stored on a computer. A concordancer – a computer programme.
PRIMENJENA LINGVISTIKA I NASTAVA JEZIKA II 3 rd class.
Compilers Principles, Techniques, & Tools Taught by Jing Zhang
Component 1.6.
课程名 编译原理 Compiling Techniques
Unit# 8: Introduction to Computer Programming
Variables In programming, we often need to have places to store data. These receptacles are called variables. They are called that because they can change.
Introduction to C++ Programming
Unit 1: Introduction Lesson 1: PArts of a java program
PHP.
Levels of Linguistic Analysis
Compilers Principles, Techniques, & Tools Taught by Jing Zhang
Basic Programming Lab C.
Presentation transcript:

Building up Corpus of Technical Vocabulary – Strategies and Feasibility Presenters: Dr. Aparna Palle, Preetha Anthony GNITS, HYDERABAD

An overview of the presentation Introduction Theoretical premise Interfacing of ESP and Corpora Criteria for selection of words Web Tools The Corpus Classroom techniques Conclusion

What is Corpus? Corpora or corpuses are simply large collections or databases of language, incorporating stretches of discourse ranging from a few words to entire books. (Norbert Schmitt, 2000). A corpus is a collection of naturally occurring texts that is usually stored on a computer. (Randi Reppen, 2011). A corpus is a large collection or database of machine-readable texts involving natural discourse in diverse contexts. (Bernardini, 2000)

Definition A Corpus is an inventory of essential language inputs drawn from authentic contexts using web tools.

Why Corpus? Emphasis on the specific needs of the learners of professional courses. Limited vocabulary to perform academic tasks. Lack of knowledge of specialised vocabulary. Corpus data provide descriptive insights relevant to how people use language. Acts as tool that enable students and instructors to analyse both how people use different language forms at various levels of formality and how language fulfils multiple speech functions across contexts.

Why Corpus? (contd.) Learning activities centred on analysing corpus data are consistent with current principles of language- learning theory, that is students develop more autonomy when they receive guidance about how to observe language and make generalizations. Such activities promote noticing and grammatical consciousness raising (Schmidt 1990), which can enhance second language learning and development.

Word-building criteria Frequency and Range Keyword in context Collocation Homonymy Word families Idioms and set expressions etc…..

Web tools AWL Highlighter British National Corpus (BNC) Collins Cobuild Corpus Concordance Sampler Compleat Lexical Tutor Corpus.BYU.edu Corpus of Contemporary American English (COCA) WordSmith Source: Materials Development in Language Teaching, Ed. By Brian Tomlinson (1998)

AWL Highlighter

Corpus of Computer Programming Word List (CCPWL)

Source from which the Corpus was extracted “C the Complete Reference” Herbert Schildt

Distinguishing Technical Vocabulary (Computing) from others Category 1 : The word form appears rarely if at all outside this particular field De bug, operand, recompile, loop Purely Technical Category 2 : The word form is used both inside and outside this particular field but not with the same meaning Characters, flag, error, default, constants Homonyms - specialised Category 3 : The word form is used both inside and outside this particular field, but the majority of its uses with a particular meaning though not all, are in this field. The specialised meaning it has in this field is readily accessible through its meaning outside the field. Variable, parameter, in-put, out-put, pre-fix, code Homonyms - general Category 4 : The word form is more common in this field than else where. There is little or no specialisation of meaning, though someone knowledgeable in the field would have a more precise idea of its meaning. Manuals, memory, application, functions Literal Meaning

Filling Word Parts NounVerbAdjectivesAdverbs Compatability Programme Incremental variously

Cutting up complex words Meanings of the Prefixes: Re – Again En – also De- down, away completely removal, reversal WordMeaning Decode:a methodical process of finding and reducing the number of defects, in a computer program or a piece of electronic hardware EncodeThe process of assigning load addresses to various parts of a program and adjusting the code and data in the program to reflect the assigned addresses Debuggingthe process of putting a sequence of characters (letters, numbers, punctuation, and certain symbols) into a specialized format for efficient transmission or storage Relocationthe conversion of an encoded format back into the original sequence of characters

Choosing the Correct Form Learning C is similar and ____ (easy). Instead of straight-away l______ (learn)how to write programs, we must first know what alphabets, numbers and special symbols are ____ (use) in C, then how _____ (use) them constants, variables and keywords are _____ (construct), and _____ (final) how are these _____ combine) to form an _____ (instruct).

Strengthening the Form – Meaning Connection WordDefinition Manuala value automatically assigned SyntaxA well structured collection of information for reference Defaultthe set of rules that defines the combinations of symbols

Answering questions Qn. Differentiate between syntax error and semantic error. Ans. A syntax error is an error in the type of code or statement. A semantic error basically means invalid logic. Qn. What is the difference between character array and integer array? Ans. Character array stores an array of characters, where as an integer array stores sequence of number integers.

Defining in the second language (a) Term (b) class (c ) defining characteristics (a)A character constant is (b) either a single alphabet, a single digit or a single special symbol (c) enclosed within single inverted commas. (b)A variable in C is (b) a quantity which may vary (c ) during programme execution. (a) Key words are (b) the words whose meaning has already been explained (c ) to the C compiler.

Conclusion Writing skills of the learners would be enhanced with the appropriate use of technical vocabulary. Teaching of vocabulary becomes meaningful enhancing their academic writing. The learners would be able to produce better answers using the words from the corpus – the end result from examination point of view is fulfilled. Enhancement of learner autonomy. Confident in their discourse with the professional community.