Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, 18-24 September 2008. Bioinformatics Primer.

Slides:



Advertisements
Similar presentations
Introduction to Linux command line for bioinformatics Wenjun Kang, MS Jorge Andrade, PhD 6/28/2013 Bioinformatics Core, Center.
Advertisements

Linux commands exercise 1. What do you need, if you try to these at home? You need to download and install Ubuntu Linux from the Internet – DVD is need.
Linux, it's not Windows A short introduction to the sub-department's computer systems Gareth Thomas.
Learning Unix/Linux Bioinformatics Orientation 2008 Eric Bishop.
ENEE150: Discussion 1 Section 0104 Please Sit Down at a Computer and Login!
Cosc 4750 Getting Started in UNIX Don’t be afraid of the prompt, in linux it can be your best friend. In some cases, the only way to do certain things.
© Janice Regan, CMPT 102, Sept CMPT 102 Introduction to Scientific Computer Programming Preparation for working in the CSIL Basic LINUX operations.
Introduction to Linux Alan Orth April 17, 2010 ILRI, Nairobi.
Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Introduction to examination.
It's a binary file kept under specific directory.
ATS Programming Short Course I INTRODUCTORY CONCEPTS Tuesday, Jan. 27 th, 2009 Essential Unix Commands.
Introducing the Command Line CMSC 121 Introduction to UNIX Much of the material in these slides was taken from Dan Hood’s CMSC 121 Lecture Notes.
Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September 2008 Elisabet Andersson, Alistair Chalk Stem Cell Biology and Bioinformatic.
Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Chip-chip and handling.
Linux+ Guide to Linux Certification, Second Edition
T UTORIAL OF U NIX C OMMAND & SHELL SCRIPT S 5027 Professor: Dr. Shu-Ching Chen TA: Samira Pouyanfar Spring 2015.
1 Basics of Linux On linux machine: Login at your home directory Open a “shell” or “terminal” or “xterm” workspace (4) On windows machine Intall linux.
Very Quick & Basic Unix Steven Newhouse Unix is user-friendly. It's just very selective about who its friends are.
UNIX Command-line Introduction Terence Parr. Navigating  cd  pwd  ls  pushd/pod  cd  pwd  ls  pushd/pod.
Virtual Machine and UNIX. What is a VM? VM stands for Virtual Machine. It is a software emulation of hardware. By using a VM, you can have the same hardware.
Shell Scripting Basics Arun Sethuraman. What’s a shell? Command line interpreter for Unix Bourne (sh), Bourne-again (bash), C shell (csh, tcsh), etc Handful.
1 SEEM3460 Tutorial Unix Introduction. 2 Introduction What is Unix? An operation system (OS), similar to Windows, MacOS X Why learn Unix? Greatest Software.
CS 141 Labs are mandatory. Attendance will be taken in each lab. Make account on moodle. Projects will be submitted via moodle.
Using Macs and Unix Nancy Griffeth January 6, 2014 Funding for this workshop was provided by the program “Computational Modeling and Analysis of Complex.
Introduction to UNIX/Linux Exercises Dan Stanzione.
MCB Lecture #3 Sept 2/14 Intro to UNIX terminal.
Applied Bioinformatics Course Overview & Introduction to Linux Bing Zhang Department of Biomedical Informatics Vanderbilt University
Unix Primer. Unix Shell The shell is a command programming language that provides an interface to the UNIX operating system. The shell is a “regular”
Linux environment ● Graphical interface – X-window + window manager ● Text interface – terminal + shell.
Dedan Githae, BecA-ILRI Hub Introduction to Linux / UNIX OS MARI eBioKit Workshop; Nov , 2014.
A few Linux basics Network Monitoring & Management.
1 SEEM3460 Tutorial Unix Introduction. 2 Introduction Unix-like system is everywhere Linux Android for smartphones Google Chrome OS for Chromebook Web.
PROGRAMMING PROJECT POLICIES AND UNIX INTRO Sal LaMarca CSCI 1302, Fall 2009.
Session 2 Wharton Summer Tech Camp Basic Unix. Agenda Cover basic UNIX commands and useful functions.
BIF713 Basic Unix/Linux Commands Getting Help with Commands.
November 5, 2015CIS 118 Intro to UNIX What’s a Unix? Unix is an Operating System Designed for high-traffic usage Many variants –Linux, BSD: free versions.
1 Remote Access Telnet Telnet FTP FTP. 2 Applications and Communications Telnet Telnet  Program for accessing systems remotely.  Available on Windows.
Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Day 5-2 What bioinformatics.
1May 16, 2005 Week 2 Lab Agenda Command Line FTP Commands Review More UNIX commands to learn File name expansion - * Introduction of vi.
CS 245 – Part 1 Using Operating Systems and Networks for Programmers Jiang Guo Dept. of Computer Science California State University Los Angeles.
Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Day 1 – 3 Introduction.
ENEE150: Discussion 1 Section 0104/0105 Please Sit Down at a Computer and Login!
AN INTRO TO UNIX/LINUX COMMANDS BY: JIAYANG WANG.
1 Introduction to Unix. 2 What is UNIX?  UNIX is an Operating System (OS).  An operating system is a control program that helps the user communicate.
 Last lesson, the Windows Operating System was discussed along with the Windows command shell  Unix is a computer operating system, that similarly manages.
General Computer Science for Engineers CISC 106 Lecture 03 James Atlas Computer and Information Sciences 6/15/2009.
Learning Unix/Linux Based on slides from: Eric Bishop.
General Computer Science for Engineers CISC 106 Lecture 03 James Atlas Computer and Information Sciences 9/9/2009.
1 CSE 391 Lecture 10 Do anything from anywhere – tools to free your choice of OS slides created by Marty Stepp, modified by Jessica Miller and Ruth Anderson.
Assignprelim.1 Assignment Preliminaries © 2012 B. Wilkinson/Clayton Ferner. Modification date: Jan 16a, 2014.
Review Why do we use protection levels? Why do we use constructors?
Tutorial Six Linux Basics CompSci Semester Two 2016.
Hackinars in Bioinformatics
Precept I : Lab Environment, Unix, Bash, Emacs
UNIX To do work for the class, you will be using the Unix operating system. Once connected to the system, you will be presented with a login screen. Once.
How to Work on your METEO 473 codes from your PC at home
Tutorial of Unix Command & shell scriptS 5027
Getting started with CentOS Linux
Andy Wang Object Oriented Programming in C++ COP 3330
Some Linux Commands.
Assignment Preliminaries
Tutorial of Unix Command & shell scriptS 5027
Tutorial of Unix Commands
Tutorial of Unix Command & shell scriptS 5027
Tutorial of Unix Command & shell scriptS 5027
Getting started with CentOS Linux
Andy Wang Object Oriented Programming in C++ COP 3330
UNIX/LINUX Commands Using BASH Copyright © 2017 – Curt Hill.
Tutorial Unix Command & Makefile CIS 5027
Short Read Sequencing Analysis Workshop
Presentation transcript:

Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Bioinformatics Primer Goal: Introductory skills for bioinformatics analysis. Format: Complete the exercises, ask anything.

Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Basic Skills – interpro interpro ( Exercise –for 3 proteins important to your research area (choose 2 well defined, 1 not well defined)‏ –download their protein sequence from –analyse them using interpro what domains do they contain? what are the functions of these domains? what families do the proteins belong to? –how would you do this on 100 proteins, or 20,000 proteins?

Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Basic Skills – gene ontologies Gene Ontology database – Exercise –Keep this information saved as you will use it thje following days –1) Define Molecular function Biological process Subcellular location –2) Find GO identifiers that describe functions, processes or locations that are relevant to your research List the identifier, type and description. Should you use identifiers further up or down the hierarchy?

Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Basic Skills – gene ontologies Exercise continued –3) For 3 proteins relevant to your research What GO terms are assigned to the protein? What evidence is there for the assignments? –4) Describe the difference between the evidence codes. –5) How would you find all proteins with a specific molecular function?

Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Basic Skills – ArrayExpress/GEO GEO/ArrayExpress –Microarray repository tools containing published microarray data –Note differences in ease of use and completeness! Exercise –Compare GEO and ArrayExpress. –Search for Human stem cell microarray studies –What are the GEO/ArrayExpress identifiers for some recent Stem cell microarray studies? –What data is available? Raw data? Processed data? –Download a CEL file (or set of CEL files) from a stem cell microarray study. –Go to ArrayExpress Atlas Look up at least two genes of interest (in stem cell biology)‏ What does the database tell you?

Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Basic Skills – Ensembl Exercise –Go to Ensembl. Describe it. –Look up a (human) gene. How many transcript variants does it have? –Explore! –Use BioMart to gather all Ensembl identifiers and Entrez geneIDs for all human and mouse genes, export this data into excel (you will need this later).

Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Basic Skills – UCSC Exercise –Go to genome.ucsc.edu. –Look up a (human) gene. Select many different gene models – how many transcript variants are found for your gene in UCSC known genes, AceView, Refseq? –Use the table browser to download all human genes (refseq) into excel. –What else of interest can you download?

Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Basic Skills – R See accompanying worksheet

Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Basic Skills – command line login Try this on your own laptop Windows command line –windows+R, type “cmd” Cygwin (unix in windows)‏ –open cygwin putty (log into a unix server)‏ –ip address, username, password VMware (virtual machine within windows)‏ –choose a unix virtual machine (i.e. tinyunix)‏ –open a terminal Apple Mac –OS X: open a terminal

Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Basic Skills – command line Basic command line operations –Directories cd : Change the current directory pwd : get current working directory –Viewing files and directories ls : list the contents of a directory (dir)‏ more : see contents of file on screen, stop after every page less : see contents of file (with better ability to move in the file)‏ cat : see contents of file, don't stop at new page

Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Basic Skills – command line Basic command line operations –Editing files emacs : open file for editing in emacs –other programs: nedit, vi –Copying and moving files cp : copy file to destination (copy)‏ mv : move (or rename) file to destination (move)‏

Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Basic Skills – command line Basic command line operations –login and copying ssh / scp : login to server, copy files –viewing parts of files head -#lines : look at first # lines tail -#lines : look at last # lines –pattern matching grep -e “pattern” : find lines in file with “pattern” grep -v “pattern” : find lines in file without “pattern” –counting wc : count words, lines in file

Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Basic Skills – command line Basic command line operations –> : send results to file more filename > filename2 (send all of filename to filename2)‏ ls > directory_contents.txt –pipe :”|” : send the results forward to another program grep -e “pattern” filename > filename_pattern.txt head -5 filename > filename_pattern.txt

Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September command line exercises Create a directory, name it after yourself What is the current working directory? Copy “exercise.txt” into your directory Change the working directory to that directory Look at the file with “more” Read the man page for wc with “man wc” What are the first 5 lines? What are the last 3 lines? How many lines contain the word “fish”? (hint you need to use pipe)‏

Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September command line exercises command line in windows –Test the following in windows command line (open with windows-key + R, then “cmd”)‏ more | (pipe)‏ grep wc sed –Which work, which do not? –How do you find help for a program ​ ? –What is “sed” for?

Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Additional resources Plenty of tutorials are available online for R and unix –Unix tutorial for beginers –R –Note some are very large (100+ pages)‏