Oracle vs SQL Server Dr. Alex Wang. Oracle Text Oracle Text uses standard SQL to do almost everything. Full-text retrieval technology, deal with unstructured.

Slides:



Advertisements
Similar presentations
LIS618 lecture 6 Thomas Krichel structure DIALOG –basic vs additional index –initial database file selection (files) Lexis/Nexis.
Advertisements

How Come It Takes Me So Long to Get Answers to Simple Questions About My Business? Technologies for Business Intelligence Introduction to Microsoft Access.
Benchmarking Oracle 8i Intermedia Text Background for this benchmark Interesting new features in OIMT Benchmarking, methodology and problems Results Conclusions.
Chapter 5: Introduction to Information Retrieval
Introduction to Information Retrieval
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
Data Mining (and Machine Learning) With Microsoft Tools Michael Lisin, Plaster Group May 8, 2014.
Text mining Extract from various presentations: Temis, URI-INIST-CNRS, Aster Data …
IS530 Lesson 12 Boolean vs. Statistical Retrieval Systems.
6/16/20151 Recent Results in Automatic Web Resource Discovery Soumen Chakrabartiv Presentation by Cui Tao.
DB2 Net Search Extender Presenter: Sudeshna Banerji (CIS 595: Bioinformatics)
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
Introduction to Full-Text Searching in SQL Server 2012 Adolfo J. Socorro, Ph.D. IT Impact, Inc.
Oracle Text Operations J. Molka-Danielsen Sept. 30, 2002.
Lecture-8/ T. Nouf Almujally
Premier Accessibility Suite Software for Reading and Writing.
Welcome to Business Intelligence Demo. Oracle Partner  We are an Oracle Partner.  We are professionals in Oracle Products.  Our Expertise… Oracle Applications.
Microsoft Office System UK Developers Conference Radisson Edwardian, Heathrow 29 th & 30 th June 2005.
CERN – European Organization for Nuclear Research Administrative Support - Advanced Information Systems Introduction to Oracle interMedia-Text By Derek.
Text Search and Fuzzy Matching
Exploring Word Grauer and Barber1 Committed to Shaping the Next Generation of IT Experts. Chapter 1: What Will Word Processing Do For Me? Robert.
Rationale Aspiring Database Developers should be able to efficiently query and maintain databases. This module will help students learn the Structured.
Data Mining Douglas C. Atkins, OCP FASTER Staff: 1998 – 2010.
Web Search Created by Ejaj Ahamed. What is web?  The World Wide Web began in 1989 at the CERN Particle Physics Lab in Switzerland. The Web did not gain.
Indexing Knowledge Daniel Vasicek 2014 March 27 Introduction Basic topic is : All Human Knowledge Who Cares? Simple Examples.
Database Solutions for Storing and Retrieving XML Documents.
Oracle Application Express (Oracle APEX), formerly called HTML DB, is a Free rapid web application development tool for the Oracle database.
Life Sciences Integrated Demo Joyce Peng Senior Product Manager, Life Sciences Oracle Corporation
Searching Databases. What is in the Library? The Online Library has thousands of journal articles and electronic books available for your use. Also available.
Word Processing An introduction to Microsoft Word Lecture 16.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Dr. Susan Gauch When is a rock not a rock? Conceptual Approaches to Personalized Search and Recommendations Nov. 8, 2011 TResNet.
 Text Representation & Text Classification for Intelligent Information Retrieval Ning Yu School of Library and Information Science Indiana University.
SQL pepper. Why SQL File I/O is a great deal of code Optimal file organization and indexing is critical and a great deal of code and theory implementation.
Advanced searching with Oracle Text Indexing and searching in text and documents Author: Krasen Paskalev Certified Oracle DBA Semantec.
SQL pepper. Why SQL File I/O is a great deal of code Optimal file organization and indexing is critical and a great deal of code and theory implementation.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
…and postgis & full text search & fuzzy comparisons.
Video Google: A Text Retrieval Approach to Object Matching in Videos Josef Sivic and Andrew Zisserman.
Oracle Dependencies Analyzer ODA Over time, in large companies we see many Legacy systems that work with several Databases, this.
Data Mining By Dave Maung.
Chapter 10: The Data Tier We discuss back-end data storage for Web applications, relational data, and using the MySQL database server for back-end storage.
´Google-ized´ search in your business data Author: Krasen Paskalev Certified Oracle 8i/9i DBA Seniour Oracle Consultant Semantec GmbH Benzstr.
Keyword Searching Weighted Federated Search with Key Word in Context Date: 10/2/2008 Dan McCreary President Dan McCreary & Associates
1 Opinion Retrieval from Blogs Wei Zhang, Clement Yu, and Weiyi Meng (2007 CIKM)
WEB MINING. In recent years the growth of the World Wide Web exceeded all expectations. Today there are several billions of HTML documents, pictures and.
With Doug Atkins Getting Data Out of FASTER: Tips for the New & Experienced.
Foundations of Business Intelligence: Databases and Information Management.
1 Information Retrieval LECTURE 1 : Introduction.
Information Retrieval
1 Data Mining: Text Mining. 2 Information Retrieval Techniques Index Terms (Attribute) Selection: Stop list Word stem Index terms weighting methods Terms.
Clusterpoint Margarita Sudņika ms RDBMS & NoSQL Databases & tables → Document stores Columns, rows → Schemaless documents Scales UP → Scales UP.
Data Mining Instructor: Dr C. C. Chan Presented by Uday Seelam.
XML Databases. XML Like HTML –Tags –Fixed vocabulary of tags and fixed structure –Tags indicate formatting, not semantics Strict HTML – XHTML –Always.
BIT 3193 MULTIMEDIA DATABASE CHAPTER 4 : QUERING MULTIMEDIA DATABASES.
1 CS 430 / INFO 430 Information Retrieval Lecture 12 Query Refinement and Relevance Feedback.
Web based Documentation Distribution Tools: MSAccess database (DSN) DreamWeaver Ultradev Microsoft Image Composer Clicking on the document will open an.
Using Find, Replace, Go To In Microsoft Word By Krysia Biville.
Analyzing Text with SQL Server 2014, R, AND Azure ML Dejan Sarka.
Superhero Power BI Peter Myers Bitwise Solutions.
CS520 Web Programming Full Text Search Chengyu Sun California State University, Los Angeles.
Slide 1 © 2016, Lera Technologies. All Rights Reserved. SAP BO vs SPLUNK vs OBIEE By Lera Technologies.
CS520 Web Programming Full Text Search
Oracle Advanced Analytics
CS122B: Projects in Databases and Web Applications Winter 2017
Introduction to Web programming
Introduction to Information Retrieval
Web Mining Research: A Survey
Presentation transcript:

Oracle vs SQL Server Dr. Alex Wang

Oracle Text Oracle Text uses standard SQL to do almost everything. Full-text retrieval technology, deal with unstructured data. Data source could be database table, flat files, web sites. Index, search, analyze text and documents. Searching: keyword searching, context query, pattern matching, thematic queries, HTML/XML section searching. Use relevance-ranking to improve search quality. Supported formats: PDF, MS Office, HTML, XML

Search Operators used in Oracle Context search Near - return a score based on the proximity of two or more terms. Pattern search Fuzzy - spelled similar. Soundex - sound alike. Stem - search for all terms with the same root. Use thesaurus Preferred Term - replace query term with prefered term define in a thesaurus. Related Term - Expand to all related term defined in a thesaurus. Synonym - Expand to all terms defined as synonyms. Narrow Term - Expand to all terms defined as the narrower/lower level terms. Broader Term - Expand to all terms defined as broader/higher level terms. Top Term -

Search Operators used in SQL Server CONTAINS can search for: A word near another word. The prefix of a word or phrase. Soundex Function (for search sound alike). A word inflectionally generated from another (for example, the word drive is the inflectional stem of drives, drove, driving, and driven). A word that is a synonym of another word using thesaurus (for example, the word metal can have synonyms such as aluminum and steel).

FeatureOracleMicrosoft Available inSE, EEEE Decision TreeYY Support Vector MachineYN Neural NetworkNY Naive BayesYY Adaptive Bayes NetworkYN K-meansYY Expectation MaximizationNY Orthogonal ClusteringYN Path clusterNY Minimal Descriptor LengthYN Time SeriesYY Association RulesYY Note: Minimal Descriptor Length, identifies the relative importance of an attribute in predicting a given outcome.

Oracle emphasize PL/SQL statement Simple Prediction Query Question: Select all customers who have a high propensity to attrite (> 80% chance) SQL Query: SELECT A.cust_name, A.contact_info FROM customers A WHERE PREDICTION_PROBABILITY(tree_model, ‘attrite’ USING A.*) > 0.8

An Example of Oracle Text Mining Building a DT Models CREATE TABLE dt_settings ( setting_name VARCHAR2(30), setting_value VARCHAR2(30)); BEGIN -- Populate settings table INSERT INTO dt_sample_settings VALUES (dbms_data_mining.algo_name, dbms_data_mining.algo_decision_tree); COMMIT; DBMS_DATA_MINING.CREATE_MODEL( model_name => 'sales_type_model', mining_function => dbms_data_mining.classification, data_table_name => 'sales_dataset', case_id_column_name => 'sales_id', target_column_name => 'sales_type', settings_table_name => 'dt_settings'); END;

An Example of SQL Server Text Mining A Tutorial for Text Classification using SQL Server 2005 Beta2 Data Mining Peter Pyungchul Kim SQL Business Intelligence Microsoft Corporation mmunity/_tutorials/688.aspx

Data Source 5000 postings from 5 news groups We know which posting belong to which group Flat text file Goal: create a model based on these data to classify each posting to its group Randomly chose 70% for training, 30% for testing.

SQL Server You can do it by click through SQL Server GUI tools. 1. SQL Mgmt Studio - Create database, import the data 2. Business Intelligence Development Studio – Build a dictionary, term vectors. 3. Build/Test data mining models

Compare Classification Results