Command Line For windows an “ok” ssh program is putty.

Slides:



Advertisements
Similar presentations
Learning Unix/Linux Bioinformatics Orientation 2008 Eric Bishop.
Advertisements

Dayu Zhang 9/8/2014 Lab02. Example of Commands pwd --- show your current directory This is home of venus, not your home directory Tilde: means you are.
"Nothing in biology makes sense except in the light of evolution" Theodosius Dobzhansky:
MCB 371/372 Sequence alignment 3/30/05 Peter Gogarten Office: BSP 404 phone: ,
MCB 372 BLAST, unix, Perl continued J. Peter Gogarten Office: BPB 404 phone: ,
"Nothing in biology makes sense except in the light of evolution" Theodosius Dobzhansky:
T UTORIAL OF U NIX C OMMAND & SHELL SCRIPT S 5027 Professor: Dr. Shu-Ching Chen TA: Samira Pouyanfar Spring 2015.
Engineering H192 - Computer Programming The Ohio State University Gateway Engineering Education Coalition Lect 4P. 1Winter Quarter Introduction to UNIX.
Shell Scripting Basics Arun Sethuraman. What’s a shell? Command line interpreter for Unix Bourne (sh), Bourne-again (bash), C shell (csh, tcsh), etc Handful.
Using Macs and Unix Nancy Griffeth January 6, 2014 Funding for this workshop was provided by the program “Computational Modeling and Analysis of Complex.
MCB Lecture #3 Sept 2/14 Intro to UNIX terminal.
Overview of Linux CS3530 Spring 2014 Dr. José M. Garrido Department of Computer Science.
Help session: Unix basics Keith 9/9/2011. Login in Unix lab  User name: ug0xx Password: ece321 (initial)  The password will not be displayed on the.
1 Intro to Linux - getting around HPC systems Himanshu Chhetri.
Essential Unix at ACEnet Joey Bernard, Computational Research Consultant.
Unix Basics Chapter 4.
CENT 305 Information Systems Security Linux Introduction.
Basic unix commands that everyone should know (Even if you have a mac) Slightly more advanced:
ECT 250: Survey of E-Commerce Technology FrontPage Publishing pages Unix.
Unix Tutorial for FreeSurfer Users. Helpful To Know FreeSurfer Tutorial Wiki:
PROGRAMMING PROJECT POLICIES AND UNIX INTRO Sal LaMarca CSCI 1302, Fall 2009.
Unix Tutorial for FreeSurfer Users. Helpful To Know FreeSurfer Tutorial Wiki:
Founded in 1982 at the Los Alamos National Laboratory Initially managed at Stanford in conjunction with the BIOSCI/Bionet news groups transition.
Operating Systems and Using Linux CMSC 104, Lecture 3 John Y. Park 1.
1 Operating Systems and Using Linux Topics What is an Operating System? Linux Overview Frequently Used Linux Commands Reading None.
A Genomics View of Unix. General Unix Tips To use the command line start X11 and type commands into the “xterm” window A few things about unix commands:
Unix Commands PowerPoint Presentation developed for LS 560 Information Technology online class - University of Alabama by Debey Sklenar TENacious Cohort.
"Nothing in biology makes sense except in the light of evolution" Theodosius Dobzhansky:
Unix Servers Used in This Class  Two Unix servers set up in CS department will be used for some programming projects  Machine name: eustis.eecs.ucf.edu.
1 Introduction to Unix. 2 What is UNIX?  UNIX is an Operating System (OS).  An operating system is a control program that helps the user communicate.
Object Oriented Programming COP3330 / CGS5409.  Compiling with g++  Using Makefiles  Debugging.
Learning Unix/Linux Based on slides from: Eric Bishop.
Object Oriented Programming COP3330 / CGS5409.  Assignment Submission Overview  Compiling with g++  Using Makefiles  Misc. Review.
CMSC 104, Version 9/011 Operating Systems and Using Linux Topics What is an Operating System? Linux Overview Frequently Used Linux Commands Reading None.
Overview of Linux Fall 2016 Dr. Donghyun Kim
Introduction to Unix for FreeSurfer Users
UNIX To do work for the class, you will be using the Unix operating system. Once connected to the system, you will be presented with a login screen. Once.
First Day in Lab Making a C++ program
Tutorial of Unix Command & shell scriptS 5027
Commands Basic syntax of shell commands UNIX or shell commands have a basic structure command -options target command comes first (such as cd or ls) any.
Web Programming Essentials:
"Nothing in biology makes sense except in the light of evolution"
Command Line For windows an “ok” ssh program is putty.
Part 3 – Remote Connection, File Transfer, Remote Environments
The Linux Operating System
Assignment Preliminaries
Tutorial of Unix Command & shell scriptS 5027
Tutorial of Unix Command & shell scriptS 5027
Intro to UNIX System and Homework 1
Operating Systems and Using Linux
Operating Systems and Using Linux
Introduction to Linux Week 0 - Thursday.
"Nothing in biology makes sense except in the light of evolution"
Operating Systems and Using Linux
Web Programming Essentials:
Tutorial of Unix Command & shell scriptS 5027
A Brief Overview of Unix
Operating Systems and Using Linux
"Nothing in biology makes sense except in the light of evolution"
Genbank Founded in 1982 at the Los Alamos National Laboratory
"Nothing in biology makes sense except in the light of evolution"
Operating Systems and Using Linux
Operating Systems and Using Linux
CSCI N317 Computation for Scientific Applications Unit 1 – 1 MATLAB
Command line.
Operating Systems and Using Linux
BLAST Slides adapted & edited from a set by
Introduction to Linux and Code Editing
BLAST Slides adapted & edited from a set by
Presentation transcript:

Command Line For windows an “ok” ssh program is putty. The favored operating system flavor in computational biology is UNIX/LINUX. The command line is similar to DOS. Some of the frequently used commands are here pwd ls ls –l chmod chmod a+x blastall.sh chmod 755 *.sh cd cd .. cd $HOME passwd ps ps aux rm more cat vi (text editor) ps ps aux ssh sftp For windows an “ok” ssh program is putty. UConn also has a site license for the ssh program from ssh.com

UNIX Basic UNIX commands 
 ls, cd, chmod, cp, rm, mkdir, more (or) less, vi, ps, kill -9, man A brief listing is here chmod is a particular pain in the ... . Under unix every file has an owner and the owner, his group and everyone else have permissions to read, write and/or execute the file (or they don’t). If you want to see which permissions are currently assigned to your files, type ls -l at the command prompt. chmod a+x *.pl gives everyone execute permission for all files that end with .pl the * is a wildcard. (warning don't ever use rm in conjunction with *) 
 For more on chmod type  ”man chmod“.
 (In the OSX GUI you can control click at a file, and change permissions in the info box). Most ssh clients (FUGU and SSH) allow you to use a GUI to change file permissions (in FUGU ctrl click).

Unix - command line interface If you tried to execute a command, and you made a mistake, for example, you mistyped a file name, you can recall the last command using the up arrow (down arrow for more recent). If you are tired typing long filenames, you can use the tab key to complete the line, provided there is only one way to complete the line. E.g: cd /Desktop could be replaced by cd /D<tab> If there are two or more choices you hear a boing, if you hit <tab> again, you get a list of choices. If you want to become more familiar with the unix command line, the code-academy has a good introduction at https://www.codecademy.com/courses/learn-the-command-line

characters at the end of lines File tranfers from Windows to UNIX and return: End of Line characters are a problem. Under Windows DO NOT use notepad, it does not understand UNIX newline symbols ‘\n’. Best write your programs under UNIX using vi or vim (or any other editor you are comfortable with) 2nd best is to use a text editor like textwrangler (very nice and free program for UNIX). Like vi and vim it provides context dependent coloring. 3rd best is to remove end of line symbols in a UNIX editor or use sed (Stream EDitor) after you transferred the file: 
 sed s/.$// name_of_WINDOWS_infile > name_of_UNIX_outfile 
(This replaces the last non letter character before the eol ($) with nothing) Some versions of office allow to change files as UNIX textfiles, but ... A related problem is encountered by Mac users. Most text editors will use MAC carriage returns at the end of the line. Most unix programs will not be able to handle these. In a terminal window you could use the following command to convert your file:
 tr ’\r' ’\n' < name_of_the_Mac_file > name_of_the_unix_file 
 If you are working in a GUI environment, you also could use the convertNewLines.app program (install it in your application folder, drag the file you want to convert into the icon). The program is available here. The EoL confusion is very inconvenient, but there really is no easy solution, tough luck; and you better know about this in case something goes wrong.

Special characters: \n #newline \t #tab

To move files between local PC and server: For windows machines: install ssh client from ftp://ftp.uconn.edu/restricted/ssh/ For Macintosh computers: install Filezilla (client!) from https://filezilla-project.org/

Example: SSH to bbcsrv3.biotech.uconn.edu qlogin formatdb -i p_abyssi.faa -o T -p T blastall -i t_maritima.faa -d p_abyssi.faa -o blast.out -p blastp -e 10 -m 8 -a2 ./extract_lines.pl blast.out Perl script that only retains the first hit and gets rid of comment lines sftp results load into spreadsheet sort data, do histogram … the extract_lines.pl script is here (you can sftp it into your account, you’ll need to chmod 755 extr*.pl afterwards)

Genbank Founded in 1982 at the Los Alamos National Laboratory Initially managed at Stanford in conjunction with the BIOSCI/Bionet news groups 1989-92 transition to the NCBI on the east coast One precursor was Margaret Dayhoff’s Atlas of Protein Sequence and Structure In 1987 genbank fit onto a few 360 KB floppy disks. Genbank uses a flat file database format (see http://en.wikipedia.org/wiki/Flat_file_database) NCBI does not use a relational databank (as in Oracle, peoplesoft) NCBI stores data in ASN.1 format (http://en.wikipedia.org/wiki/Abstract_Syntax_Notation_One), which allows to hardwire crosslinks to other data bases, and makes retrieval of related information fast. NCBI’s sample record (http://www.ncbi.nlm.nih.gov/Sitemap/samplerecord.html) contains links to most the fields used in the gbk flatfile. In the genbank records at NCBI the links connect to the features (i.e. the pubmed record, or the encoded protein sequence) --- not easy to work with.

Dr. Margaret Belle (Oakley) Dayhoff March 11, 1925 – February 5, 1983 Among other things, we owe her the first nucleotide and protein data bank, the PAM substitution matrix, and the single letter amino acid code. (Image from wikipedia)

Atlas of Protein Sequences 1972 (cont) The Atlas also contained RNA sequences, and PAM matrix for nucleotides

Atlas of Protein Sequences 1972 (cont) Contained phylogenetic reconstructions that went back in time to far before the Last Unversal Common Ancestor (LUCA) aka the cenancestor of all living cellular organisms alive today. tRNA phylogeny

PAM 250 log (odds) matrix Dayhoff recoding

Selecting Scoring Matrices Choose a matrix appropriate to the suspected degree of sequence identity between the query and its target sequences PAM: empirically derived for close relatives BLOSUM: empirically derived for distant relatives Kerfeld and Scott, PLoS Biology 2011 13 8. Teaching Tools