Martin Croome VP Business Development GreenWaves Technologies.

Slides:



Advertisements
Similar presentations
DSPs Vs General Purpose Microprocessors
Advertisements

TIE Extensions for Cryptographic Acceleration Charles-Henri Gros Alan Keefer Ankur Singla.
A reconfigurable system featuring dynamically extensible embedded microprocessor, FPGA, and customizable I/O Borgatti, M. Lertora, F. Foret, B. Cali, L.
Lowest power 1.8 V audio codec with integrated miniDSP in production today Luca Cacioli Audio Product Marketing Embargoed Until Oct. 27, 2008.
Introduction to ARM Architecture, Programmer’s Model and Assembler Embedded Systems Programming.
Embedded Computing From Theory to Practice November 2008 USTC Suzhou.
1 Design and Implementation of Turbo Decoders for Software Defined Radio Yuan Lin 1, Scott Mahlke 1, Trevor Mudge 1, Chaitali.
ATMEL AVR. Facts ● ATMEL formed in 1984 ● AVR line introduced in 1993 ● RISC Architecture ● 90 instructions most of which can be performed in 1 clock.
1 Computer Science, University of Warwick Architecture Classifications A taxonomy of parallel architectures: in 1972, Flynn categorised HPC architectures.
Uncovering the Multicore Processor Bottlenecks Server Design Summit Shay Gal-On Director of Technology, EEMBC.
ASIP Architecture for Future Wireless Systems: Flexibility and Customization Joseph Cavallaro and Predrag Radosavljevic Rice University Center for Multimedia.
March 9, 2015 San Jose Compute Engineering Workshop.
A few issues on the design of future multicores André Seznec IRISA/INRIA.
Hardware Benchmark Results for An Ultra-High Performance Architecture for Embedded Defense Signal and Image Processing Applications September 29, 2004.
1)Leverage raw computational power of GPU  Magnitude performance gains possible.
DSP Architectures Additional Slides Professor S. Srinivasan Electrical Engineering Department I.I.T.-Madras, Chennai –
ULTRALEV: Ultra-Low-Energy Video Sensor Networks for IoT ($700B market) Technology Research Center University of Turku Finland.
Fall 2012 Parallel Computer Architecture Lecture 4: Multi-Core Processors Prof. Onur Mutlu Carnegie Mellon University 9/14/2012.
Chapter 1 Coding Introduction.
READ ME FIRST Use this template to create your Partner datasheet for Azure Stack Foundation. The intent is that this document can be saved to PDF and provided.
Parallel Programming Models
Programmable Logic Devices
i.MX Processor Roadmap i.MX 8 family i.MX 8M family i.MX 8X family
Gopakumar.G Hardware Design Group
Lightweight Cryptography for IoT
18-447: Computer Architecture Lecture 30B: Multiprocessors
Hardware design considerations of implementing neural-networks algorithms Presenter: Nir Hasidim.
Benchmarking Deep Learning Inference
Dynamo: A Runtime Codesign Environment
ECE354 Embedded Systems Introduction C Andras Moritz.
Microarchitecture.
Designing with ARM Cortex-M4 (and a Real Dev Board)
Low-power Digital Signal Processing for Mobile Phone chipsets
Ph.D. in Computer Science
PRESENTATION ON ARM PROCESSORS
SECTIONS 1-7 By Astha Chawla
Hot Processors Of Today
Embedded Systems Design
Texas Instruments TDA2x and Vision SDK
Introduction.
Intel’s Core i7 Processor
Department of Electrical & Computer Engineering
Ultra-Low-Voltage UWB Baseband Processor
Why microcontrollers in embedded systems?
Ray-Cast Rendering in VTK-m
Comp 541 Wrap Up! Montek Singh Apr 27, 2018.
New horizons in the artificial vision
ریز پردازنده. ریز پردازنده مراجع درس میکروکنترلرهای AVR برنامه نویسی اسمبلی و C محمدعلی مزیدی، سپهر نعیمی و سرمد نعیمی مرجع کامل میکروکنترلرهای AVR.
Digital Processing Platform
CS294-1 Reading Aug 28, 2003 Jaein Jeong
Chapter 1 Introduction.
Objective Understand the concepts of modern operating systems by investigating the most popular operating system in the current and future market Provide.
Compiler Back End Panel
Detailed Analysis of MiBench benchmark suite
STUDY AND IMPLEMENTATION
Compiler Back End Panel
What's New in eCognition 9
The performance requirements for DSP applications continue to grow and the traditional solutions do not adequately address this new challenge Paradigm.
A High Performance SoC: PkunityTM
3D applications in Delphi
Chapter 1 Introduction.
AI Stick Easy to learn and use, accelerate the industrialization of artificial intelligence, and let the public become an expert in AI.
Final Project presentation
Implementation of a GNSS Space Receiver on a Zynq
Global megatrends (relevant for our business)
Objective Understand the concepts of modern operating systems by investigating the most popular operating system in the current and future market Provide.
CSE 502: Computer Architecture
ADSP 21065L.
What's New in eCognition 9
What's New in eCognition 9
Presentation transcript:

Martin Croome VP Business Development GreenWaves Technologies

WHY? AI IS MOVING TO THE EDGE From the cloud To the edge of IoT CLOUD CONGESTION PRIVACY DECISION LATENCY POWER Company Proprietary

But what happens WHEN You GET TO THE VERY EDGE? at the very Edge WEARABLE SMALL EASILY INSTALLABLE DISTANT FROM POWER DIFFICULT TO SERVICE PROBLEMS TO SOLVE Wide range of compute STATES Wide range of compute TASKS Scale consumption to workload Absolute consumption in state Speed of transitions between states Acquisition, pre-processing, inference, (communication) Rapidly changing algorithms

GAP8 enables at the very Edge A highly integrated MCU combining a 8 core parallel compute cluster and a single core controller ~20x better power efficiency than the state-of-the-art on content understanding applications at < 100mW High compute at ultra-low-power 01 Ultra-fast dynamic, power state transitions Wake up in 0.5ms 1uA standby current Agile 02 Fully programmable Wide range of accelerated algorithms Flexible 03 Company Proprietary

An example of GAP8’s Energy Efficiency TARGET CLOCK TIME CYCLES ACTIVE POWER STM32 F7 216Mhz 99.1ms 21 400 000 60mW GAP8 * 15.4Mhz 1 500 000 3.7mW 175Mhz 8.7ms 70mW GAP8 ** 4.7Mhz 460 000 0.8mW STM 32 H7 216Mhz 40nm 16 x reduction 11 x 75 x Comparison of Latest optimized ARM CMSIS-NN library versus GAP8 implementation of identical CNN graph trained on CIFAR-10 images. Source*: ARM processors blog Running on GAP8 cluster. * No Hardware Convolution Engine ** With Hardware Convolution Engine Company Proprietary

How does GAP8 achieve Energy Efficiency? Uniform Extended Instruction Set (ISA) DSP Extensions SIMD instructions Bit manipulation 01 Programmable in C/C++ Efficient parallelization Hardware thread dispatch and synchronization integrated with automatic clock gating 02 GCC based toolchain Shared instruction cache Reducing power overhead of instruction fetch 03 Visual IDE based on Platform IO Code generation tool (AutoTiler) smooths parallel code generation HW Convolution Engine Single cycle 5 x 5 16 bit fixed point convolution 04 Pre-built generators for a wide range of algorithms Ultra fast HW state changes All power management on SoC 05 Company Proprietary

A wide range of use cases are enabled by GAP8 “6 cars at traffic light” Smart city “10 desks free in office” Smart spaces “bearing failing” Industrial “hi Martin” Consumer PEOPLE / OBJECT COUNTING VIBRATION ANALYSIS ROBOT CONTROL AND NAVIGATION SOUND ANALYSIS FAR FIELD KEYWORD SPOTTING “broken window by stone” Smart building “take off” Consumer “not a human” Security “wake up! something is happening” Government OBJECT RECOGNITION HIERARCHICAL ACTIVATION

Open source origins Best in class Instruction Set Architecture (ISA) UC Berkeley originated Open Source Computing Platform created by ETHZ and UniBo Engineered as Ultra-low power IoT Application Processor GreenWaves staff are on the RISC-V technical and marketing committees GreenWaves is a key contributor to PULP LEVERAGING COMMUNITIES UNIQUE CAPITAL EFFICIENCY Company Proprietary

GAP8 is Available Now GAPuino GAP8 - buy on the websites: 01 01 https://greenwaves-technologies.com/store/ https://www.seeedstudio.com/GAPUINO-GAP8-Developer-Kit-1st-fully-programmable-multi-core-RISC-V-Processor-for-IoT-Application-p-3090.html Full open source SDK - get on GitHub https://github.com/GreenWaves-Technologies/gap_sdk 02 GAP8 engineering samples - buy on website: 03 https://greenwaves-technologies.com/store/ GAP8 production qualified - Q2 2019 04

Thank you! Questions?