Mnih, Volodymyr, et al. "Human-level control through deep reinforcement learning." Nature 518.7540 (2015): 529-533. Experiments by: Ashish Budhiraja Course:

Slides:



Advertisements
Similar presentations
Courses: Dr. Gareth Rice (2012) Concepts, Theories and Urban Studies Tue E206, Physicum Lectures, videos and focussed class.
Advertisements

Reinforcement Learning & Apprenticeship Learning Chenyi Chen.
Deep Learning.
Visualization CSC 485A, CSC 586A, SENG 480A Instructor: Melanie Tory.
Elluminate Features CITE Office. Interactive Features Two-way audio Two-way audio One-way video One-way video  May be switched among participants Interactive.
Bieber et al., NJIT © Using PLA to Liberate Learning (PLA: participatory learning approach) Michael Bieber, Jia Shen, Dezhi Wu, Vikas Achhpiliya.
Computer Game Programming Rick Barker Cecil Schmidt Carol Browning Ernest Ferguson.
1. Introduction to Pattern Recognition and Machine Learning. Prof. A.L. Yuille. Dept. Statistics. UCLA. Stat 231. Fall 2004.
To High School and Beyond: Career Explorations in the Middle School Classroom Alonia M. Johnson, M.Ed., M.S. Technology Instructor.
Enhancing Learning and Creating Community Using Wimba Classroom Hannah German The Lookstein Center, Bar-Ilan University
Multi-task Low-rank Affinity Pursuit for Image Segmentation Bin Cheng, Guangcan Liu, Jingdong Wang, Zhongyang Huang, Shuicheng Yan (ICCV’ 2011) Presented.
Course overview Course title: Discrete mathematics for Computer Science Instructors: Dr. Abdelouahid Derhab Credit.
computer
Understanding the Emotional Impact of Images Jia JIA Computer Science, Tsinghua University Joint work with Xiaohui WANG, Peiyun HU, Sen WU, Jie TANG and.
Andrew Ng Feature learning for image classification Kai Yu and Andrew Ng.
12/7/10 Looking Back, Moving Forward Computational Photography Derek Hoiem, University of Illinois Photo Credit Lee Cullivan.
Arcade Games 1970~1980 Atari produced was known as Pong, an arcade- friendly version of Hinginbotham’s Tennis for Two. Pong was the first big commercially.
Monitoring and Enhancing Visual Features (movement, color) as a Method for Predicting Brain Activity Level -in Terms of the Perception of Pain Sensation.
Course Overview  What is AI?  What are the Major Challenges?  What are the Main Techniques?  Where are we failing, and why?  Step back and look at.
Reinforcement learning (Chapter 21)
Computer Vision Group Prof. Daniel Cremers Autonomous Navigation for Flying Robots Lecture 1.1: Welcome Jürgen Sturm Technische Universität München.
PENGENALAN POLA DAN VISI KOMPUTER PENDAHULUAN. Vision Vision is the process of discovering what is present in the world and where it is by looking.
Nutrition Course 360 NCAT Conference March Nutrition Course 360 Features Practice test Personalized Learning Plans Study Tools Imbedded animation.
Population Dynamics Using Multi Agent Modeling Annie Martin Computer Systems Lab Period 3,
Psychology 562 Advanced Human Factors 1 Week 1-1: Introduction.
Deep Learning and Deep Reinforcement Learning. Topics 1.Deep learning with convolutional neural networks 2.Learning to play Atari video games with Deep.
Conditional Generative Adversarial Networks
Deep Reinforcement Learning
Graph Representations
Deep Reinforcement Learning
POSITION-CORRECTING TOOLS FOR 2D DIGITAL FABRICATION
From Vision to Grasping: Adapting Visual Networks
Article Review Todd Hricik.
Perceptual Loss Deep Feature Interpolation for Image Content Changes
Deep reinforcement learning
How it Works: Convolutional Neural Networks
نتعارف لنتألف في التعارف تألف (( الأرواح جنود مجندة , ماتعارف منها أئتلف , وماتنافر منها اختلف )) نماذج من العبارات الايجابية.
Reinforcement learning with unsupervised auxiliary tasks
Here is the graph of a function
Human-level control through deep reinforcement learning
Contemporary Texas Project
Putt-Putt Golf (Miniature Golf) Course Project
Jia-Bin Huang Virginia Tech ECE 6554 Advanced Computer Vision
Physics-based simulation for visual computing applications
ECE 599/692 – Deep Learning Lecture 1 - Introduction
Martin Granger-Piché Emric Epstein Pierre Poulin
Tagging Review Comments Rationale #10 Week 13
Adventures in Computational Thinking By Chin Hao Chang
A graphing calculator is required for some problems or parts of problems 2000.
An Introduction to Computer Vision& Pattern Recognition Group
Graphic Sources By Ms. Martin.
Convolutional Network by GoogLeNet
Anarghya Mitra and Zelun Luo
Textual Video Prediction
Matthew J. Kolek et al. JACEP 2017;j.jacep
Wrap-up Computer Vision Spring 2019, Lecture 26
ENES 182 Engineering the Future
Jia-Bin Huang Virginia Tech
Project Design and Framing
Online verification using reachable occupancies.
Examples of AEGIS autonomous pointing refinement.
Matthew J. Kolek et al. JACEP 2018;4:
Line Graphs.
Rate of patients with overall lesion visualization and characterization scored good/excellent or poor/fair. Rate of patients with overall lesion visualization.
Search-Based Approaches to Accelerate Deep Learning
UCF-REU in Computer Vision
Graphic Sources By Ms. Martin.
Week 1 Overview - Cecilia La Place
Fig. 2 Visualization of features.
Sequence plot visualizing the development of symptom frequency for the cohort at the individual level between 2006 and Sequence plot visualizing.
Presentation transcript:

Mnih, Volodymyr, et al. "Human-level control through deep reinforcement learning." Nature 518.7540 (2015): 529-533. Experiments by: Ashish Budhiraja Course: Advanced Computer Vision Instructor: Jia-Bin Huang

Qlearning

Videos at different Learning Rate (Breakout) 77 epochs 200 epochs

Videos at different Learning Rate (Pong) 141 epochs 200 epochs

Videos at different Learning Rate (Seaquest) 178 epochs 200 epochs

Feature Visualization file:///home/ashishkb/RL_ACV/neon/simple_dqn/results/breakout.html

Breakout Graphs

Space Invaders Graphs

Credits: Prof. FRED G. MARTIN http://www.cs.uml.edu/ecg/index.php/AIfall16/PS3b Mnih, Volodymyr, et al. "Human-level control through deep reinforcement learning." Nature 518.7540 (2015): 529-533. https://github.com/devsisters/DQN-tensorflow https://github.com/tambetm/simple_dqn