Financial Data Management

Slides:



Advertisements
Similar presentations
Yi Wang CTO, Data Services Ivy Li Director, Equity Data Collection
Advertisements

TELLEFSEN AND COMPANY, L.L.C. Execution Management Systems and Order Management Systems – Evolution and Growth December 2010 Proprietary and Confidential.
Your partner. Your provider. DeveCore DeveCore Corporation 2012 This is a brief presentation about DTx, the SMS gateway from DeveCore. For more technical.
Service Oriented Architecture for Mobile Applications Swarupsingh Baran University of North Carolina Charlotte.
Chapter 1 Business Driven Technology
Ezra T. Ernst Chief Executive Officer Swets Information Services, Inc. The Long Tail and its’ application To Scholarly Information.
NetWORKS Strategy Manugistics NetWORKS Strategy 6.2.
AMG Database of Fund Flows and Holdings Calculating Fund Flows and Holdings.
By N.Gopinath AP/CSE. Why a Data Warehouse Application – Business Perspectives  There are several reasons why organizations consider Data Warehousing.
An Analysis of Security and Privacy Issues in Smart Grid Software Architectures on Clouds Dresden, 22/05/2014 Felipe de Sousa Silva Simmhan, Kumnhare,
Bloomberg Industry Research :Data Company. Content Introduction Bloomberg Overview and History Core Business: Professional Services Reuters Overview and.
Data Data is collection of facts and figures which are not in directly usable form. It is also termed as Input about an item, a person or a place. It.
PgP MIS 202 Access Overview 1 Microsoft Access Introduction to Relational Databases Powerful tool to collect and analyze business data, facilitates decision-
1 CONCERT 2004 Power to the Librarian Delivering Transparency in the Serials Market Doug McMillan Managing Director Bowker UK Ltd.
ITGS Case Study Theatre Booking System Ayushi Pradhan.
One Marketplace Access Exchange.
FINESTI IS A SUBSIDIARY OF THE LUXEMBOURG STOCK EXCHANGE FINESTI Collecting, managing, disseminating Access to the Luxembourg fund universe Gateway to.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
Built on the Microsoft Azure Platform, Prudena Provides Users with the Ideas and Confidence to Make Sound Investment Decisions MICROSOFT AZURE APP BUILDER.
Fall Supervisor : Shams Naveed Zia What is “Advisr” ? Advisr is going to be a web-based application which will be targeted towards brokerage houses,
Object storage and object interoperability
Morningstar Direct 3.1 Enhancements to: Navigation and Usability Functionality Data Released on Friday, Feb. 2, 2007 Morningstar Direct Team.
Lecture On Introduction (DBMS) By- Jesmin Akhter Assistant Professor, IIT, Jahangirnagar University.
Axis AI Solves Challenges of Complex Data Extraction and Document Classification through Advanced Natural Language Processing and Machine Learning MICROSOFT.
THE FUTURE IS HERE: APPLICATION- AWARE CACHING BY ASHOK ANAND.
System Software Laboratory Databases and the Grid by Paul Watson University of Newcastle Grid Computing: Making the Global Infrastructure a Reality June.
Role of Metadata in dissemination of census data Regional Seminar on dissemination and spatial analysis of census data, Nairobi, September, 2010.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
TECHNOLOGY IN ACTION. Chapter 11 Behind the Scenes: Databases and Information Systems.
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Social Media Analytics Market to Global Analysis and Forecast by Type, Application and Vertical No of Pages: 150 Publishing Date: Jan 2017 Single.
MEMBERS Horizon – The Value of Risk Control
Technology in the Exchange Industry
Building a Data Warehouse
Thank you/Appreciate time Intro me- Manage channel last 2 years
Wallpaper only – on screen during welcome and chat
Introduction To DBMS.
Avenues International Inc.
Decision Support Systems
INFORMATION SYSTEM CATEGORIES
Title Arrow Cloud NOTE: Remember to update name on slide.
Lecture 9 - Business Information Systems: Electronic Business Systems
Operation Data Analysis Hints and Guidelines
INTRODUCTION TO BUILDING A PORTFOLIO
MIS2502: Data Analytics Advanced Analytics - Introduction
Global E-Business: How Businesses Use Information Systems
Free Cloud Management Portal for Microsoft Azure Empowers Enterprise Users to Govern Their Cloud Spending and Optimize Cloud Usage and Planning MICROSOFT.
EzyAccounting An Accounting Software An Accounting Software By: Delicate Software Solutions Dubai, Manage Your Business… Not Just Accounts.
Moving the Needle Conference 2017
PMC – Office Hours Topic: Campaign Management.
Numbers, places, decisions
Explore. Discover. Focus.
IFX Forum Overview September 28, 2015 © Copyright IFX Forum, Inc
Fulfilling omni-channel demand Introduction
Mapping Your Future®: Supporting Standards
Media365 Portal by Ctrl365 is Powered by Azure and Enables Easy and Seamless Dissemination of Video for Enhanced B2C and B2B Communication MICROSOFT AZURE.
Automating Profitable Growth™
Data Discovery Change Committee.
Automating Profitable Growth™
MUMT611: Music Information Acquisition, Preservation, and Retrieval
SDMX: an Overview Abdulla Gozalov UNSD.
Data Warehousing Concepts
Databases This topic looks at the basic concept of a database, the key features and benefits of a Database Management System (DBMS) and the basic theory.
Banking and the Federal Reserve
The Weather Company, an IBM Business
Retail Business challenge
Multivariate Analysis - Introduction
Accounting Discipline Overview
Presentation transcript:

Financial Data Management Yi Wang Morningstar, Inc. My topic today is about financial data management. First of all, I’d like to give you a quick introduction of Morningstar.

Morningstar is a leading global provider of independent investment research. We have operations in 26 countries.

Individual investors served worldwide Financial advisors 7.4 Mil 270,000 4300 400,000 Individual investors served worldwide Financial advisors Institutional clients Investment offerings Supporting 7.4 mil individual investors; 270K financial advisors, 43 hundred institutional clients Morningstar provides data on approximately 400,000 investment offerings, including stocks, mutual funds, and similar vehicles, along with real-time global market data on more than 5 million equities, indexes, futures, options, commodities, and precious metals, in addition to foreign exchange and Treasury markets. Our business runs on data

What is a time series? TP: With so many different types of financial data, my focus of the day is on time series What’s a time series? A time series is a sequence of data points, measured typically at successive times spaced at uniform time intervals 3) Time series has natural temporal ordering

Usage of a time series? TP: A stock price chart is a typical example of where time series is being used. Audience with statistic and math background may start to think of the stationary, serial correlation, moving averages …. <next slide> and other modeling techniques .

TP: … and other modeling techniques in order to extract meaningful characteristics and to forecast what the future may be like.

Morningstar time series database Coverage 1984: First print product; 400 mutual funds 1991: First electronic product; 2,300 mutual funds; 10,000 time series Now: Multiple line of products; 143,000 mutual funds; over 100 million time series Variety of data Intra-day price Equity fundamentals Fees, expenses Economic series Weather … Here is brief overview of what we have in Morningstar time series database At the time we published our first product in 1984 Intra-day price Equity fundamentals Fees, expenses Economic series Weather …

Challenges & Solutions Having such vast amount of time series data in our database enables us to provide powerful research capabilities, but in the mean time poses challenges in various areas, and I’d like to share with you here the major challenges we encountered and solutions we’ve explored

Collection and processing Challenges: Multiple data source Identification Consistency Labor intensive efforts Solutions: Intelligent data consolidation Dependency awareness Starting from data collection and processing, the first challenge we encounter is we got data from multiple data sources, so Which is the right copy and How to aggregate is the first question we need to answer Since investment identifiers we get can change from time to time, and varies by providers, we also need to figure out is how to link information for the same time series together to our permanent identifier As most of the time series in our system are derieved from one or multiple raw data series, how to keep all the related time series consistent when there are corrections to the raw data is another area we need to look into. Obviously, with the massive nature of time series data, collecting and maintain a quality database requires a large amount of work We focus in two areas for our solution: building a intellegent data consolicataion collection system, and create dependency consicious processing mechanism

Storage and dissemination Challenges: Latency and throughput requirement Deliver data to meet different demands (with low latency) Solution: System designed for time series When the time comes to store and deliver data, we have to deal with the latency and throughput requirement and how to deliver data to meet different demands, because same data need to be delivered in different format and delivery mechanism Size: Accessing speed Network bandwidth Delivery challenge Granularity Format Delivery metho We researched a lot of existing solutions but none seemed to be meeting our need and ended up developing an inhouse solution.

Globalization Challenges: Regulatory Standardization Data representation Solution: Market and culture sensitive How did we do it when expanding our data coverage from US based to Global? How do we support localization with a global perspective? The first area we need to look into is regulatory, for example, when European countries started to adapt Euro, how to handled the different adpation time of euro for each country, how to support pre euro currencies that no longer in existence Aside from regulatory concerns, standization is another area that requires a lot of consideration. Aside from what we commonly know about standardiization, which is finding the commonality among information that presented in different ways in different location, we also need to be aware of what not to standardize, because standardining information that appears to be same will compromise data quality and interpretation. Lastly Data presentation: unit of data, different format of same data when presented in different country are examples we have to take into account TP: With offices & service in 26 country, how to we during to localization with a global perspective? European currency change Time zone Expectation on Data quality

A closer look In the next couple of slides, I’ll show you how we put all the afore-mentioned solutions together in our system using market price as example

For when the price reaches us through exchanges till it gets to products like this

Data Consolidation Mapping rule User interface Quality rule Morningstar products Data Source Quality rule Clean up rule Merge rule User interface Machine learning Mapping rule Data Consolidation NASDAQ Time series system Data vendor Calculation When price for a company, say Apple, is delivered to us through exchange, we… Next, let’s take a closer look at how our proprietary system works once time series data is stored in it.

Time series engine Data Interface Data filter Data assembler Data formatter Query Engine Market adapter Time aggregator Localization adapter At the bottom level, we have the storage department that archives and indexes the data values as well as meta data. It was built on a distribution storage scheme on cloud platform to optimize data compression. In the middle is where most of our intelligence… that handle all the custom query and optimization, as well as business rule On top is where all the formalization and transformation happens before the data is presented to the user. Time series is definitely an interesting and unique data that is grant itself an independent place for it to be discussed. I’m sure many of you may have a different take on it than us, and it would be great if we can discuss further  Storage Metadata Time series content