Object Recognition & Detection

Slides:



Advertisements
Similar presentations
Academic Advisor: Dr. Yuval Elovici Technical Advisor: Dr. Rami Puzis Team Members: Yakir Dahan Royi Freifeld Vitali Sepetnitsky 2.
Advertisements

GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.
© 2012 Cisco and/or its affiliates. All rights reserved. Presentation_ID Cisco Public Quad APIs and SDK Preview Sachin Smotra Product Manger, Enterprise.
 Image Search Engine Results now  Focus on GIS image registration  The Technique and its advantages  Internal working  Sample Results  Applicable.
AceMedia Personal content management in a mobile environment Jonathan Teh Motorola Labs.
LLNL-PRES This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.
Making You Explore the Potential of Online Business CMS Based - Web Development Solutions.
Multimedia Databases (MMDB)
 The most intelligent device - “Human Brain”.  The machine that revolutionized the whole world – “computer”.  Inefficiencies of the computer has lead.
 CIKM  Implementation of Smoothing techniques on the GPU  Re running experiments using the wt2g collection  The Future.
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
Image Retrieval and Ranking using L.S.I and Cross View Learning Sumit Kumar Vivek Gupta
Sparse Coding: A Deep Learning using Unlabeled Data for High - Level Representation Dr.G.M.Nasira R. Vidya R. P. Jaia Priyankka.
Automatic License Plate Recognition for Electronic Payment system Chiu Wing Cheung d.
9/24/2017 7:27 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
WHY VIDEO SURVELLIANCE
WHY VIDEO SURVELLIANCE
Connected Infrastructure
2/13/2018 4:38 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Automatic Grading of Diabetic Retinopathy through Deep Learning
All about Ashley GmbH COMMUNICATION PARTNERS Partner overview.
4/19/ :02 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
The Relationship between Deep Learning and Brain Function
ITCS 6157/8157: Visual Database
What’s new with Power BI /guyinacube.
CSE Jeongbin Choe Advisor: Prof. Bohyung Han (CV Lab)
Let’s talk Power BI Premium /guyinacube Adam Saxton.
Utilizing AI & GPUs to Build Cloud-based Real-Time Video Event Detection Solutions Zvika Ashani CTO.
Deep Learning for Natural Language Processing in R
Connected Infrastructure
Deep Learning Libraries
Changing how people interact with computers
Implementing Boosting and Convolutional Neural Networks For Particle Identification (PID) Khalid Teli .
Introductory Seminar on Research: Fall 2017
Ajita Rattani and Reza Derakhshani,
Handling Data Using Databases
Pearson Lanka (Pvt) Ltd.
New horizons in the artificial vision
Interactive Learning An empFinesseTM Smart Atomic Learning Solution.
Bird-species Recognition Using Convolutional Neural Network
Artificial Intelligence Changes the Security Landscape
A Comparative Study of Convolutional Neural Network Models with Rosenblatt’s Brain Model Abu Kamruzzaman, Atik Khatri , Milind Ikke, Damiano Mastrandrea,
Image Processing Platform
Dog/Cat Classifier Christina Stiff.
Introduction to Deep Learning with Keras
Alain Goossens & Jean-Pierre Van Loo Data scientists – SII Belgium
Oral presentation for ACM International Conference on Multimedia, 2014
Social media for global scientific community – Mendeley project
Declarative Transfer Learning from Deep CNNs at Scale
Technical Capabilities
AGMLAB Information Technologies
WHY VIDEO SURVELLIANCE
John H.L. Hansen & Taufiq Al Babba Hasan
RCNN, Fast-RCNN, Faster-RCNN
CS246: Information Retrieval
DATS International Portfolio.
WHY VIDEO SURVELLIANCE
Neural Network Pipeline CONTACT & ACKNOWLEDGEMENTS
Research Institute for Future Media Computing
Airport Parking Space Navigation
Heterogeneous convolutional neural networks for visual recognition
Automatic Handwriting Generation
Human-object interaction
Image Processing and Multi-domain Translation
Object Detection Implementations
What's New in eCognition 9
CRCV REU 2019 Kara Schatz.
Report 2 Brandon Silva.
Huawei CBG AI Challenges
AI Builder for Power Platform
Presentation transcript:

Object Recognition & Detection An empFinesseTM Fundamentals Solution

Regions of Interest (ROI) Key Terms Deep Learning Object Recognition Convolutional Neural Network (CNN) Object Detection Custom Vision API Regions of Interest (ROI) Object Classification MS CNTK

Detect like a Human Eye? 04 01 02 03 Locate objects? Recognize object /text in an image / video? Memorize and Train? Recollect / Remember?

Sample Use Cases Detect components of a motherboard of any device Recognize processor type Recognize missing parts Recognize individual employees in an office/ODC Detect in/out of resources just like an human eye Detect Non-employee vehicles in a parking lot Recognize a vehicle without any external tags Trace a car by tracing its entry/exit in each toll Identify defective stage of a product Early detection of defects

Object Detection Approach

About Solution ObjecTell Process an Image Extract ROIs Classify Regions Faster CNN flow http://hcltonazure.cloudapp.net/imagerecognition/Detect.aspx

Technology Perspectives MongoDB Detection UI Search UI Admin UI CNTK Python scripts ObjectTell WCF Service Azure Vision API Connector DAL Connector Objects Recognizer Search Component Dataset Manager Metadata Manager User Query Manager Resource Data Retriever Azure Computer Vision API Service Azure Custom Vision API Service ObjectTell Dataset Train UI Object Marker Usage Dashboard Character Recognizer NLP Core Video Parser Usage Analytics UI Layer Service Layer Data Access Layer Yet to be implemented

Current & Future State Products / Libraries Used Maturity Level MS CNTK 2.3 Library Anaconda3 4.1.1 Python 3.5 MS Custom Vision API Azure Computer Vision API for OCR OpenALPR API Stanford NLP core (yet to integrate) Maturity Level First stage: Training / Learning phase (currently got a decent model for cars dataset, grocery dataset) Second stage: Objects Recognition and Detection– Azure hosted web application is available now. We can improvise this solution to embed within a mobile as an mobile app and user can snap a picture directly through this app and recognize the objects in that picture. Third stage: Self-learning phase - We have to enhance our solution to self learn based on user corrections/suggestions. Future State Recognize other domain / industry related objects Parse video into individual frames and recognize objects / text Support English like query through NLP Embed this feature within a camera Generate Report on number of detections and accuracy

Business Model

38.92 B$ business by 2021 (image Processing) Business Relevance 38.92 B$ business by 2021 (image Processing)

Scalability # Scalability Need Current State Resource Needed 1 Process more images (10,000s) in few hours It takes around 6-9 hours to process 100+ images Deep Learning VM / GPU optimized VMs 2 Improve object Recognition and Detection accuracy Recognizes fully visible objects. If they overlap then accuracy and probability of detecting more objects is 50% 3 Cater to More domains Trained for Grocery and vehicles dataset, it can be extended to other domains as well Need VMs/servers with more hard disk space (>500 GB) 4 Cater to more users Have tested with 5 users – more number of parallel users have impact on detection time Deep Learning VM / GPU optimized VMs and design level change to create different threads for multiple parallel users 5 Cater to multiple customers at same time It can cater to 1 customer users per Azure site instance More HDD space and optimized VMs

Snapshots

Thank You