Object Recognition & Detection

Slides:

Advertisements

Similar presentations

Academic Advisor: Dr. Yuval Elovici Technical Advisor: Dr. Rami Puzis Team Members: Yakir Dahan Royi Freifeld Vitali Sepetnitsky 2.

Advertisements

GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.

© 2012 Cisco and/or its affiliates. All rights reserved. Presentation_ID Cisco Public Quad APIs and SDK Preview Sachin Smotra Product Manger, Enterprise.

 Image Search Engine Results now  Focus on GIS image registration  The Technique and its advantages  Internal working  Sample Results  Applicable.

AceMedia Personal content management in a mobile environment Jonathan Teh Motorola Labs.

LLNL-PRES This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.

Making You Explore the Potential of Online Business CMS Based - Web Development Solutions.

Multimedia Databases (MMDB)

 The most intelligent device - “Human Brain”.  The machine that revolutionized the whole world – “computer”.  Inefficiencies of the computer has lead.

 CIKM  Implementation of Smoothing techniques on the GPU  Re running experiments using the wt2g collection  The Future.

September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.

Image Retrieval and Ranking using L.S.I and Cross View Learning Sumit Kumar Vivek Gupta

Sparse Coding: A Deep Learning using Unlabeled Data for High - Level Representation Dr.G.M.Nasira R. Vidya R. P. Jaia Priyankka.

Automatic License Plate Recognition for Electronic Payment system Chiu Wing Cheung d.

9/24/2017 7:27 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.

WHY VIDEO SURVELLIANCE

WHY VIDEO SURVELLIANCE

Connected Infrastructure

2/13/2018 4:38 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.

Automatic Grading of Diabetic Retinopathy through Deep Learning

All about Ashley GmbH COMMUNICATION PARTNERS Partner overview.

4/19/ :02 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.

The Relationship between Deep Learning and Brain Function

ITCS 6157/8157: Visual Database

What’s new with Power BI /guyinacube.

CSE Jeongbin Choe Advisor: Prof. Bohyung Han (CV Lab)

Let’s talk Power BI Premium /guyinacube Adam Saxton.

Utilizing AI & GPUs to Build Cloud-based Real-Time Video Event Detection Solutions Zvika Ashani CTO.

Deep Learning for Natural Language Processing in R

Connected Infrastructure

Deep Learning Libraries

Changing how people interact with computers

Implementing Boosting and Convolutional Neural Networks For Particle Identification (PID) Khalid Teli .

Introductory Seminar on Research: Fall 2017

Ajita Rattani and Reza Derakhshani,

Handling Data Using Databases

Pearson Lanka (Pvt) Ltd.

New horizons in the artificial vision

Interactive Learning An empFinesseTM Smart Atomic Learning Solution.

Bird-species Recognition Using Convolutional Neural Network

Artificial Intelligence Changes the Security Landscape

A Comparative Study of Convolutional Neural Network Models with Rosenblatt’s Brain Model Abu Kamruzzaman, Atik Khatri , Milind Ikke, Damiano Mastrandrea,

Image Processing Platform

Dog/Cat Classifier Christina Stiff.

Introduction to Deep Learning with Keras

Alain Goossens & Jean-Pierre Van Loo Data scientists – SII Belgium

Oral presentation for ACM International Conference on Multimedia, 2014

Social media for global scientific community – Mendeley project

Declarative Transfer Learning from Deep CNNs at Scale

Technical Capabilities

AGMLAB Information Technologies

WHY VIDEO SURVELLIANCE

John H.L. Hansen & Taufiq Al Babba Hasan

RCNN, Fast-RCNN, Faster-RCNN

CS246: Information Retrieval

DATS International Portfolio.

WHY VIDEO SURVELLIANCE

Neural Network Pipeline CONTACT & ACKNOWLEDGEMENTS

Research Institute for Future Media Computing

Airport Parking Space Navigation

Heterogeneous convolutional neural networks for visual recognition

Automatic Handwriting Generation

Human-object interaction

Image Processing and Multi-domain Translation

Object Detection Implementations

What's New in eCognition 9

CRCV REU 2019 Kara Schatz.

Report 2 Brandon Silva.

Huawei CBG AI Challenges

AI Builder for Power Platform

Presentation transcript:

Object Recognition & Detection An empFinesseTM Fundamentals Solution

Regions of Interest (ROI) Key Terms Deep Learning Object Recognition Convolutional Neural Network (CNN) Object Detection Custom Vision API Regions of Interest (ROI) Object Classification MS CNTK

Detect like a Human Eye? 04 01 02 03 Locate objects? Recognize object /text in an image / video? Memorize and Train? Recollect / Remember?

Sample Use Cases Detect components of a motherboard of any device Recognize processor type Recognize missing parts Recognize individual employees in an office/ODC Detect in/out of resources just like an human eye Detect Non-employee vehicles in a parking lot Recognize a vehicle without any external tags Trace a car by tracing its entry/exit in each toll Identify defective stage of a product Early detection of defects

Object Detection Approach

About Solution ObjecTell Process an Image Extract ROIs Classify Regions Faster CNN flow http://hcltonazure.cloudapp.net/imagerecognition/Detect.aspx

Technology Perspectives MongoDB Detection UI Search UI Admin UI CNTK Python scripts ObjectTell WCF Service Azure Vision API Connector DAL Connector Objects Recognizer Search Component Dataset Manager Metadata Manager User Query Manager Resource Data Retriever Azure Computer Vision API Service Azure Custom Vision API Service ObjectTell Dataset Train UI Object Marker Usage Dashboard Character Recognizer NLP Core Video Parser Usage Analytics UI Layer Service Layer Data Access Layer Yet to be implemented

Current & Future State Products / Libraries Used Maturity Level MS CNTK 2.3 Library Anaconda3 4.1.1 Python 3.5 MS Custom Vision API Azure Computer Vision API for OCR OpenALPR API Stanford NLP core (yet to integrate) Maturity Level First stage: Training / Learning phase (currently got a decent model for cars dataset, grocery dataset) Second stage: Objects Recognition and Detection– Azure hosted web application is available now. We can improvise this solution to embed within a mobile as an mobile app and user can snap a picture directly through this app and recognize the objects in that picture. Third stage: Self-learning phase - We have to enhance our solution to self learn based on user corrections/suggestions. Future State Recognize other domain / industry related objects Parse video into individual frames and recognize objects / text Support English like query through NLP Embed this feature within a camera Generate Report on number of detections and accuracy

Business Model

38.92 B$ business by 2021 (image Processing) Business Relevance 38.92 B$ business by 2021 (image Processing)

Scalability # Scalability Need Current State Resource Needed 1 Process more images (10,000s) in few hours It takes around 6-9 hours to process 100+ images Deep Learning VM / GPU optimized VMs 2 Improve object Recognition and Detection accuracy Recognizes fully visible objects. If they overlap then accuracy and probability of detecting more objects is 50% 3 Cater to More domains Trained for Grocery and vehicles dataset, it can be extended to other domains as well Need VMs/servers with more hard disk space (>500 GB) 4 Cater to more users Have tested with 5 users – more number of parallel users have impact on detection time Deep Learning VM / GPU optimized VMs and design level change to create different threads for multiple parallel users 5 Cater to multiple customers at same time It can cater to 1 customer users per Azure site instance More HDD space and optimized VMs

Snapshots

Thank You