TaLC 2000 - Graz The English Italian Translational Corpus: A resource for learning about translation Federico Zanettin Università di Bologna SSLMIT – Forlì.

Slides:



Advertisements
Similar presentations
Variation and regularities in translation: insights from multiple translation corpora Sara Castagnoli (University of Bologna at Forlì – University of Pisa)
Advertisements

Introducing COMPARA The Portuguese-English Parallel Corpus Ana Frankenberg-Garcia ISLA, Lisbon & Diana Santos SINTEF, Oslo.
Web of Science Search and Navigation in the Web of Knowledge
Stage 5 Prepare the front and back matter.. The five stages of developing a dictionary. 1.Collect words (using semantic domains). 2.Add fields (automated.
Corpus design & analysis techniques 1.  Monolingual: general, specialized, comparable  Bi/Multilingual: parallel, comparable 2.
Guy Aston SSLMIT, University of Bologna The learner as corpus designer.
Serendipity Expanded Exploring new directions for discovery learning Silvia Bernardini School for Translators and Interpreters University of Bologna Forlì.
ALMA MATER STUDIORUM – UNIVERSITA’ DI BOLOGNA 1 Alma Mater Studiorum-University of Bologna Francesca Maltauro International Relations Administrative Department.
WHAT AUTHORS WANT Understanding Author Priorities in the Self-Publishing Era Phil Sexton Publisher, Writer’s Digest.
The Translational English Corpus: A practical approach to corpus building.
GCSE Information Evening. English Language WJECWJEC Unit 1: Studying Written Language 30% Unit 2: Using Written Language 30% Unit 3: Literary.
Edexcel IGCSE Mae Patel, May IGCSE l 25/05/112 Introduction Mae Patel Commissioning Editor at Pearson Education Based in Oxford, UK Responsible.
McEnery, T., Xiao, R. and Y.Tono Corpus-based language studies. Routledge. Unit A 2. Representativeness, balance and sampling (pp13-21)
CLEF – Cross Language Evaluation Forum Question Answering at CLEF 2003 ( Bridging Languages for Question Answering: DIOGENE at CLEF-2003.
Translation Studies 8. Research methods in Translation Studies Krisztina Károly, Spring, 2006 Sources: Károly, 2002; Klaudy, 2003.
Library Language This program will help you practice the meanings for some of the words in our library language. Read the questions carefully before.
STAFF DEVELOMENT DAY IMPLEMENTATION OF THE NATIONAL CURRICULUM AMBARVALE HIGH SCHOOL CAMPBELLTOWN PERFORMING ARTS HIGH SCHOOL LEUMEAH HIGH SCHOOL.
Representatıvness, balance and samplıng ın a corpus Lınguistıcs.
1 Corpora: Annotating and Searching LING 5200 Computational Corpus Linguistics Martha Palmer.
OPTIONAL COURSE FOR STUDENTS SPECIALIZING IN
MLA Works Sited Format Modern Language Association (a professional organization serving teachers of English and other languages)
International Storytelling Festival in Alden Biesen.
Contextual Information and Metadata Sheena Gardner and Dawn Hindle CELTE, University of Warwick Corpus Linguistics Birmingham July 2005.
Estonian Web and Bibliographic Control Janne Andresoo.
1 Co-Training for Cross-Lingual Sentiment Classification Xiaojun Wan ( 萬小軍 ) Associate Professor, Peking University ACL 2009.
CSE3PE: Professional Environment Book Reviewing. What is the purpose of a book review? Professional Environment2.
TIME IS MOTION A CORPUS-BASED STUDY ON THE QUANTITATIVE DISTRIBUTION OF CONCEPTUAL METAPHORS IN TRANSLATED AND NON-TRANSLATED ENGLISH Claudia Förster Hegrenæs,
What is AP English Language and Composition? Course Introduction.
7 th Language Arts Mandi White :44 PM.
Translation Studies 9. The use of corpora in TS Krisztina Károly, Spring, 2006 Sources: Olohan, 2004; Tirkkonen-Condit, 2005.
New Reading Scheme Year 3 Thursday 10 th September.
Facilitator for DCSD Curriculum Focus Group. Teachers give feedback on the Common Core Curriculum Implementation SWOT Analysis.
Multi-level Bootstrapping for Extracting Parallel Sentence from a Quasi-Comparable Corpus Pascale Fung and Percy Cheung Human Language Technology Center,
LITERATURE LIVE Comenius – school partnership (2013 – 2015) PETER KARPINSKÝ.
Casalini libri "Aggregating European scholarly publishing for diffusion worldwide: the Casalini libri experience, from print to digital"
Non-Fiction Introduction Mrs. Barnes Grade 7 – English Language Arts.
LIALIA The LIA Project Italian Accessible Books London Book Fair– April, 11 th.
6 TH GRADE SCIENCE FAIR: RESEARCHING WITH NON-FICTION BOOKS.
The ELA FSA assesses the Language Arts Florida Standards (LAFS)
Convergences between modern languages and language(s) of schooling – Sweden –
Most Professional Translation Services provider in USA.
AP English Language and Composition Students choosing AP English Language should be interested in studying and writing various kinds of analytical or persuasive.
Assessing SNOMED CT for Large Scale eHealth Deployments in the EU Workpackage 2- Building new Evidence Daniel Karlsson, Linköping University Stefan Schulz,
Non-fiction Unfamiliar Text Strategies
CLIL Implementation in the Italian Context, and from an International School Perspective Presented by Joanne Gillespie 26th November, Thessaloniki,
Association for Teacher Education in Europe
Main Parts of a Book Non-fiction Texts.
Title of Your Research Heading 3 Heading 6 Heading 4 Heading 1
Welcome to7th Grade Language Arts
Topic: Making Inferences Essential Question: How does making inferences help when reading? Responses Notes Inferring *Use what you know to figure out meaning.
Elements of Multicultural and International Literature
Library Language.
Why is the Times Literary Supplement Historical Archive an essential resource?
Information needed for citing sources:
English Courses in Grades 11 &12
Library Language.
Library Language.
TITLE Authors Institution
Health On-Line Patient Education Web Site
A Problem of Diversity Research by the Cooperative Children's Book Center shows only 10% of books published for children in the last ten years feature.
Mrs. Clowers (5th grade) and I have had some conversations lately about informational writing. We wanted the students to do different types of writing,
COMPARATIVE Linguistics 2018/2019
Library Language.
Presentation Title SUBTITLE description LOGO.
FICTION VS. NONFICTION.
Title Goes Here Title Goes Here Title Goes Here Title Goes Here
Book Report Non- Fiction Text.
you will complete a worksheet
Presentation transcript:

TaLC Graz The English Italian Translational Corpus: A resource for learning about translation Federico Zanettin Università di Bologna SSLMIT – Forlì

CEXI Project Corpus English X / Cross / Translational Italian Bi-lingual Bi-lingual Parallel Parallel Bi-directional Bi-directional Translation-driven Translation-driven

Types of comparison ItalianEnglish TTs STs

Aims CEXI as a resource for Learning about (language, culture, translation) Learning to (read, write, translate) Limitations Funds (corpus size: 4M words) Copyright

Design criteria Selection features Description features

Primary selection criteria Translations / translated texts Medium: books Time: contemporary Audience: adult Prose Country of publication

Secondary selection criteria Author Translator Publisher Price and availability

Descriptive features Size vs. variety Full texts vs. samples Domain: fiction vs. non fiction

Fiction vs. non-fiction (Italy)

Corpus description Translation vs. non-translation Fiction vs. non-fiction English vs. Italian

Translations E  I (Italy 76-95)I  E (USA 77-96) UDC categoryTexts% % Literature/Child. Lit %50228% Art/Games/Sports7576%34319% Edu/Law/So. Sci %18711% Applied Science183516%1388% History/Geo./Biog.9198%17110% Natural & Ex. Sci6436%1116% Philosophy/Psycol.8337%533% Generalities/Info. Sci1011%20% Religion/Theology4774%26715% Total % %

Translation components, non-fiction UDC % from I.T. English (USA)Italian (Italy) Rel/Theo21% 7% Art/G/S27%11% Edu/Law/SocSci15%19% AppSci11%28% His/Geo/Bio13% Nat & EXSci 9% Phi/Psy 4%12% Gen/InfoSci 0% 1% Total100% No. of texts (provisional ) EnglishItalian

Non-translations Italian (titles)English (titles) Translations (E  I) Book production Translations (I  E) Book production Fiction 40%27% (Italy)28% (USA) 31% (UK) 22% (USA), 24% (UK) Non-fiction 60%73% (Italy)72% (USA) 69% (UK) 78% (USA), 76% (UK)

Non-fictional, non-translational components in the corpus vs. total book production Non- translation Italian Component Production (Italy) Non- translation English component Production (USA) Production (UK) Rel/Theo21%8% 7% 12% Art/G/S27%16%11%9%27% Edu/Law/SocS ci 15%28%19%29%20% AppSci11%16%28%23%15% His/Geo/Bio13%15%13% Nat & ExSci 9%5% 9%8%4% Phi/Psy 4%8%12%5%3% Gen/InfoSci 0%4% 1%6% Total100%

No. of textsSupp. textsTot al EnglishItalianEnglishItalian Rel/Theo83 58 Art/G/S114 7 Edu/Law/SS682 8 AppSci4117 His/Geo/Bio55 5 Nat & ExSci44 4 Phi/Psy253 5 Gen/InfoSci00 0 Total 40        52 Corpus composition (non-fiction)

Core corpus Sub-categoryNo. of text samples No. of words per sample Fiction4012,500 Non-fiction5212,500 Total for one component 92 texts1,150,000 words Total for core corpus (4 components) 368 texts4,600,000 words

Expansions Full texts Corrections to corpus composition Satellite corpora