Quranic Arabic Corpus Data Mining & Text Analytics By Ismail Teladia & Abdullah Alazwari.

Slides:



Advertisements
Similar presentations
A Semantic Web Approach to Digital Rights Management Roberto García González.
Advertisements

I-AIBS Institute for Artificial Intelligence and Biological Systems
School of something FACULTY OF OTHER School of Computing FACULTY OF ENGINEERING An open discussion and exchange of ideas Introduced by Eric Atwell, Language.
I-AIBS Institute for Artificial Intelligence and Biological Systems
Data Mining and Text Analytics By Saima Rahna & Anees Mohammad Quranic Arabic Corpus.
INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING NLP-AI IIIT-Hyderabad CIIL, Mysore ICON DECEMBER, 2003.
For Friday No reading Homework –Chapter 23, exercises 1, 13, 14, 19 –Not as bad as it sounds –Do them IN ORDER – do not read ahead here.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
NLP and Speech Course Review. Morphological Analyzer Lexicon Part-of-Speech (POS) Tagging Grammar Rules Parser thethe – determiner Det NP → Det.
CSE111: Great Ideas in Computer Science Dr. Carl Alphonce 219 Bell Hall Office hours: M-F 11:00-11:
Higher English Listening Assessment. Internally Assessed Units Creation and Production ▫creating at least one written text using detailed and complex.
Information Reports.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
CHAPTER 1: Language in Our Lives
ELN – Natural Language Processing Giuseppe Attardi
CAREERS IN LINGUISTICS OUTSIDE OF ACADEMIA CAREERS IN INDUSTRY.
MODULE A Experience through language Elective II = Distinctively Visual.
English Language AS and A2. Which English to study?
Changes Among Us Rossie Brackett (Meteorologist) Ashley McDonald (Farmer) Donna Haon (Historian) Introduction Task Process Evaluation Conclusion Meteorologist.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
Graphophonemic System – Phonics
Linguistics & AI1 Linguistics and Artificial Intelligence Linguistics and Artificial Intelligence Frank Van Eynde Center for Computational Linguistics.
NLP And The Semantic Web Dainis Kiusals COMS E6125 Spring 2010.
Template by Bill Arcuri, WCSD Click Once to Begin Religion JEOPARDY!
Linguistics The first week. Chapter 1 Introduction 1.1 Linguistics.
© 2005 Pearson Education, Inc. publishing as Longman Publishers. Chapter 6: Textbook Learning Breaking Through: College Reading, 7/e Brenda Smith.
From E-Content to E-Learning in Computational Linguistics Localisation of Teaching materials for less processed languages Kiril Simov *, Petya Osenova.
Markov Logic and Deep Networks Pedro Domingos Dept. of Computer Science & Eng. University of Washington.
1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
CSA2050 Introduction to Computational Linguistics Lecture 1 Overview.
Computational Linguistics. The Subject Computational Linguistics is a branch of linguistics that concerns with the statistical and rule-based natural.
CSA2050 Introduction to Computational Linguistics Lecture 1 What is Computational Linguistics?
Anthropology 340 LANGUAGE AND CULTURE Course Overview.
Introduction to Linguistics Class # 1. What is Linguistics? Linguistics is NOT: Linguistics is NOT:  learning to speak many languages  evaluating different.
Lecture 1 Lec. Maha Alwasidi. Branches of Linguistics There are two main branches: Theoretical linguistics and applied linguistics Theoretical linguistics.
For Monday Read chapter 24, sections 1-3 Homework: –Chapter 23, exercise 8.
For Friday Finish chapter 24 No written homework.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
CSE467/567 Computational Linguistics Carl Alphonce Computer Science & Engineering University at Buffalo.
Metadata, Resources, and the RDF 김민수 Chapter 1. Creating the Sementic Web with RDF2 Overview Knowledge Representation Library Metadata RDFRDF.
For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.
Annotation Procedure in Building the Prague Czech-English Dependency Treebank Marie Mikulová and Jan Štěpánek Institute of Formal and Applied Linguistics.
Syntactic Annotation of Slovene Corpora (SDT, JOS) Nina Ledinek ISJ ZRC SAZU
Latin root meaning “write”
Introduction A field survey of Dutch language resources has been carried out within the framework of a project launched by the Dutch Language Union (Nederlandse.
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
Constructing Your Response Social Studies Grades 6-8.
Basics of Natural Language Processing Introduction to Computational Linguistics.
INTRODUCTION TO APPLIED LINGUISTICS
TKT Tutoring Class Phonology.
Grammatical Issues in translation
Introduction to Artificial Intelligence Heshaam Faili University of Tehran.
Quran in Hindi ( हिन्दी कुरान ) Recite The Holy Qur’an Daily Now With this App Android Quran Application Solely designed for “ Indian Muslims ”
Mohammad Alqahtani, Dr. Eric Atwell
An Introduction to Linguistics
Al Quran Bhasa Indonesia
INTRODUCTION TO LINGUISTICS 1
ROOT WORDS Graph = Writing.
ROOT WORDS Graph = Writing.
Text Analytics Giuseppe Attardi Università di Pisa
Chapter 11 Oral Language: Listening and Speaking
Arabic Language Challenges
Introduction Task: extracting relational facts from text
word Vocabulary Cards Language Literature Social Studies Science Math
ROOT WORDS Graph = Writing.
Artificial Intelligence 2004 Speech & Natural Language Processing
 Understand Qur’an – Level-I Session No. 43
Owen Rambow 6 Minutes.
Presentation transcript:

Quranic Arabic Corpus Data Mining & Text Analytics By Ismail Teladia & Abdullah Alazwari

Introduction  What is the Quran ?  Holy book for Muslims  Revealed from 610 AD  6,236 verses, 114 chapters  Corpus Definition.  Written or spoken language  What is the Quranic Arabic Corpus ?  77,430 words of Quranic Arabic  Researcher: Kais Dukes

Features of QAC:  Morphological Annotation  Syntactic Treebank  Semantic Ontology

Morphological Annotation  Word By Word  Grammar  Syntax  Morphology  Part-of-speech tagging  Natural Language Computing Technology

Details of Word’s Grammar  Clicking the word gives more detail:  Type of Word  Translation  Gender  Case  Root  In addition it shows the verse in which word appears and sound recitation of the verse.

Syntactic Treebank  Verse by verse dependency graphs  Meaning of verse (broken down)  Sentence structure (dependencies)  Case  Mathematical graph theory

Ontology of Concepts  Knowledge representation  Relationship between concepts  Historic places and people  Named entity tagging  E.g. Sun, Moon, Star, Earth classified under “Astronomical Body”  Uses predicate logic

Visual Representation of Ontology  300 linked concepts with 350 relations

Conclusion  Uses of the QAC:  Analysing Arabic text of each verse  Linking Arabic words through dependencies  Finding relationships between concepts  Website used daily by 2,500 people from 165 countries

Map Showing Usage of QAC

Bibliography 

Thank you for listening!