Stubbs Lab Bioinformatics - 3 Review RNA-Seq Analysis Overview Alignment using Tophat2 Nov 22, 2016 Joe Troy.

Slides:



Advertisements
Similar presentations
Linux Orientation Computer Systems Lab Computer Sciences Department Room 2350.
Advertisements

Learning Unix/Linux Bioinformatics Orientation 2008 Eric Bishop.
Dayu Zhang 9/8/2014 Lab02. Example of Commands pwd --- show your current directory This is home of venus, not your home directory Tilde: means you are.
Introducing the Command Line CMSC 121 Introduction to UNIX Much of the material in these slides was taken from Dan Hood’s CMSC 121 Lecture Notes.
“Linux at the Command Line” Don Johnson of BU IS&T.
Introduction to Linux Part 1. Content 1.Basic overview of Linux 2.The Linux ‘terminal’ and structure of commands 3.Everyday Linux commands and working.
Help session: Unix basics Keith 9/9/2011. Login in Unix lab  User name: ug0xx Password: ece321 (initial)  The password will not be displayed on the.
© Crown copyright Met Office An Introduction to Linux PRECIS Workshop, University of Reading, 23rd – 27th April 2012.
Chapter 9 Part II Linux Command Line Access to Linux Authenticated login using a Linux account is required to access a Linux system. The Linux prompt will.
Welcome to Linux & Shell Scripting Small Group How to learn how to Code Workshop small-group/
Linux environment ● Graphical interface – X-window + window manager ● Text interface – terminal + shell.
Essential Unix at ACEnet Joey Bernard, Computational Research Consultant.
Basic unix commands that everyone should know (Even if you have a mac) Slightly more advanced:
Dedan Githae, BecA-ILRI Hub Introduction to Linux / UNIX OS MARI eBioKit Workshop; Nov , 2014.
ICT Foundation 1 Copyright © 2010, IT Gatekeeper Project – Ohiwa Lab. All rights reserved. File and Directory.
PROGRAMMING PROJECT POLICIES AND UNIX INTRO Sal LaMarca CSCI 1302, Fall 2009.
AE6382 Secure Shell Usually referred to as ssh, the name refers to both a program and a protocol. The program ssh is one of the most useful networking.
System Administration Introduction to Unix Session 2 – Fri 02 Nov 2007 Reference:  chapter 1, The Unix Programming Environment, Kernighan & Pike, ISBN.
Session 2 Wharton Summer Tech Camp Basic Unix. Agenda Cover basic UNIX commands and useful functions.
UNIX Commands. Why UNIX Commands Are Noninteractive Command may take input from the output of another command (filters). May be scheduled to run at specific.
The Secure Shell Copyright © Software Carpentry 2011 This work is licensed under the Creative Commons Attribution License See
Introduction to UNIX Road Map: 1. UNIX Structure 2. Components of UNIX 3. Process Structure 4. Shell & Utility Programs 5. Using Files & Directories 6.
L&T Infotech1 UNIX – Getting Started - Aneesh Ramani.
Unix/Linux for beginners:
CSE 374 Programming Concepts & Tools Hal Perkins Fall 2015 Lecture 2a – A Unix Command Sampler (Courtesy of David Notkin, CSE 303)
EGEE-III INFSO-RI Enabling Grids for E-sciencE Apr. 25, Grid Computing Hands On Training for Users Faculty of Sciences, University.
Bioinformatics for biologists Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University Presented.
1 Manufacturing Operations Center 16. Demo Data Customization Scripts APAC Training, Feb-Mar, 2010.
Isecur1ty training center Presented by : Eng. Mohammad Khreesha.
Linux Stuff Last Update Copyright 2014 Kenneth M. Chipps Ph.D. 1.
The Kernel At a high level, the kernel in an operating system serves as the bridge between applications and the actual data processing of the hardware.
 Last lesson, the Windows Operating System was discussed along with the Windows command shell  Unix is a computer operating system, that similarly manages.
Unix Fundamentals CS 127. File navigation cd - change directory cd /var/log cd /etc/apache2 cd ~/Desktop ~ is a shortcut for the home directory.
Learning basic Unix command It 325 operating system.
Agenda The Bourne Shell – Part I Redirection ( >, >>,
+ Introduction to Unix Joey Azofeifa Dowell Lab Short Read Class Day 2 (Slides inspired by David Knox)
Embedded Software Design Week II Linux Intro Linux Kernel.
ACCESS IC LAB Graduate Institute of Electronics Engineering, NTU Usage of Workstation Lecturer: Yu-Hao( 陳郁豪 ) Date:
Getting Started with Linux
Tutorial Six Linux Basics CompSci Semester Two 2016.
By Jonathan Rinfret UNIX/LINUX By Jonathan Rinfret
1.1.2 OneOs Downloading Software Upgrade
UNIX To do work for the class, you will be using the Unix operating system. Once connected to the system, you will be presented with a login screen. Once.
Tutorial of Unix Command & shell scriptS 5027
CS1010: Intro Workshop.
Stubbs Lab Bioinformatics - 2 Retrieving sequence data files and Linux commands Nov 17, 2016 Joe Troy.
Getting started with CentOS Linux
Tutorial Six Recap & Linux Basics CompSci Semester Two 2016.
Linux 101 Training Module Linux Basics.
Stubbs Lab Bioinformatics – 5 Review tophat, alignment summary and htseq-count exercises: MDS plots and Differential expression We want to be able to.
Linux 103 Training MOdule Basic System Mgmt.
Linux Commands Help HANDS ON TRAINING Author: Muhammad Laique
Andy Wang Object Oriented Programming in C++ COP 3330
Stubbs Lab Bioinformatics – 4 Alignment Summary Report & Count files with htseq-count Nov 29, 2016 Joe Troy.
Some Linux Commands.
Users/Groups/Permissions
Introduction to Linux Dr Karina Kubiak - Ossowska
Command Line Interface for Beginners
CSE 374 Programming Concepts & Tools
Introduction to UNIX.
Intro to UNIX System and Homework 1
Linux + Galaxy Server Tutorial
Unix : Introduction and Commands
The Unix File System.
Lab 00 Discussion Linux Basics
Introduction Paul Flynn
Getting started with CentOS Linux
Andy Wang Object Oriented Programming in C++ COP 3330
Tutorial Unix Command & Makefile CIS 5027
Working with Mac OS and Linux
Presentation transcript:

Stubbs Lab Bioinformatics - 3 Review RNA-Seq Analysis Overview Alignment using Tophat2 Nov 22, 2016 Joe Troy

Agenda Review of tools and Linux commands Overview of the RNA-Seq Analysis Aligning short reads (.fastq files) with Tophat2 to create alignment files (accepted_hits.bam)

Also, to create bigwigs for UCSC track hubs, we use some UCSC software.

Linux commands (review and new) cp copy. copy file ex: cp oldfile.txt newfile.txt copy folder ex: cp –R old_folder new_folder df –h See how much disk space is on the server cd change to new folder. ex: cd my_new_folder pwd print working directory, show the current folder ls –lh list contents with details (l), show file size & date as human readable (h) rm PERMANENTLY remove a file or folder. ex: rm my_file.txt removes a file named “my_file.txt” in the current working director. ex: rm -r myfolder removes a folder, and all of its contents named “myfolder” in the current working directory. ex: rm *.txt removes all file ending with ‘.txt’. ex: rm * removes everything in the current working directory BE CAREFUL. screen Screen allows you to start a “sub-process” on stubbslab.igb.illinois.edu, exit that subprocess while it continues to run (allowing you to disconnect from stubbslab.igb.illinois.edu), and reattach to the process at a later time. sh Used to start a shell script. ex: sh main_script_tophat_16Gso.sh

RNA-Seq data analysis Context and Overview

INPUT: .tgz file(s) from ftp.biotec.illinois.edu INPUT: .fastq short read files OUTPUT: “accepted_hits.bam” file from each “.fastq file” OUTPUT: .fastq short read files Retrieve and un-compress short read files Align Reads to genome Next Step: review alignment stats sftp command Tophat 2 script tar command

Terminal is used to access the Linux command line on a MAC

Instructions to alignment short reads with tophat2 INSTRUCTION SLIDE 1 Josephs-MacBook-Pro:~ josephtroy$ ssh jmtroy2@stubbslab.igb.illinois.edu jmtroy2@stubbslab.igb.illinois.edu's password: Last login: Mon Nov 21 20:15:51 2016 from c-73-73-226-74.hsd1.il.comcast.net [jmtroy2@stubbslab ~]$ df -h Filesystem Size Used Avail Use% Mounted on /dev/sda1 4.6T 4.2T 156G 97% / /dev/sda2 95G 14G 77G 16% /var /dev/sdb1 289M 29M 246M 11% /boot tmpfs 32G 0 32G 0% /dev/shm /dev/sdb2 275G 116G 145G 45% /var/lib/mysql [jmtroy2@stubbslab ~]$ screen

Instructions to alignment short reads with tophat2 INSTRUCTION SLIDE 2 [jmtroy2@stubbslab ~]$ cd /home/share/example_rna_seq_project_16Gso/ [jmtroy2@stubbslab example_rna_seq_project_16Gso]$ ls -1 code_010_tophat2 code_020_alignment_summary_report code_030_MDS_plots code_040_create_track_hub_bigwigs code_050_cpm_means_report code_060_differential_expression_w_edgeR fastq_files output_010_tophat2_RUN_20161121_092530 [jmtroy2@stubbslab example_rna_seq_project_16Gso]$ cd code_010_tophat2/ [jmtroy2@stubbslab code_010_tophat2]$ ls main_script_tophat_16Gso.sh [jmtroy2@stubbslab code_010_tophat2]$ sh main_script_tophat_16Gso.sh Start of Tophat … NOW HOLD DOWN THE CONTROL KEY AND PRESS a, THEN PRESS d, TO DETACH FROM THE SCREEN SESSION

Instructions to alignment short reads with tophat2 DEMONSTRATION SLIDE 3 [jmtroy2@stubbslab ~]$ screen -ls There is a screen on: 11559.pts-2.stubbslab (Detached) 1 Socket in /var/run/screen/S-jmtroy2. [jmtroy2@stubbslab ~]$ screen -r 11559 [end of tophat] [jmtroy2@stubbslab code_010_tophat2]$ exit [end of tophat] [jmtroy2@stubbslab code_010_tophat2]$ screen -ls No Sockets found in /var/run/screen/S-jmtroy2.

Review tophat2 output in Cyberduck

align_summary.txt NOTE: The “Mapped” rate of 99.9% is this high because of the way the example fastq files were created for the training exercise. The fastq files were created with only those reads already mapped to chromosome 5.

/home/share/example_rna_seq_project_16Gso/code_010_tophat2/main_script_tophat_16Gso.sh (1 of 2)

/home/share/example_rna_seq_project_16Gso/code_010_tophat2/main_script_tophat_16Gso.sh (2 of 2)