Using GC content to distinguish Phytophthora sequences from tomato sequences.

Slides:



Advertisements
Similar presentations
Connecting to GMT machine via Windows 7. Windows PuTTy GMT on Mac server int-038.geosci.usyd.edu.au To use GMT, you will connect to a Mac server via PuTTy.
Advertisements

Linux Platform  Download the source tar ball from the BLAST source code link  ncbi-blast src.tar.gz  Compilation  cd /BLASTdirectory/c++ ./configure.
Java Programming Working with TextPad. Using TextPad to Work with Java This text editor is designed for working with Java You can download a trial version.
Computers for Beginners. Agenda  Environment  Windows (Vista)  Mac (OS X)  Personalization  User Accounts - Passwords  Selective Startup (Windows)
T T02-04 Histogram (User Selected Classes) Purpose Allows the analyst to analyze quantitative data by summarizing it in sorted format, scattergram.
Installing Windows XP Professional Using Attended Installation Slide 1 of 41Session 2 Ver. 1.0 CompTIA A+ Certification: A Comprehensive Approach for all.
 2004 Prentice Hall, Inc. All rights reserved. Chapter 25 – Perl and CGI (Common Gateway Interface) Outline 25.1 Introduction 25.2 Perl 25.3 String Processing.
What is Blast What/Why Standalone Blast Locating/Downloading Blast Using Blast You need: Your sequence to Blast and the database to search against.
Introduction to MATLAB ENGR 1187 MATLAB 1. Programming In The Real World Programming is a powerful tool for solving problems in every day industry settings.
INTRODUCTION TO OPERATING SYSTEMS. An operating system is a program that controls the overall activity of a computer. Like an orchestra conductor an operating.
Software All parts of the computer people can NOT touch, such as programs, files, documents and any other data.
Dreamweaver – Setting up a Site and Page Layouts Web Design Section 7-2 Part or all of this lesson was adapted from the University of Washington’s “Web.
Introduction to NS2 -Network Simulator- -Prepared by Changyong Jung.
Drexel University Software Engineering Research Group 1 Eclipse for SE101.
Tomcat Setup BCIS 3680 Enterprise Programming. Getting Web Apps to Work  Verify that Tomcat works.  Understand how context works.  Create folders/files.
Launch SpecE8 and React from GSS. You can use the chemical analyses in a GSS data sheet to set up and run SpecE8 and React calculations. Analysis → Launch…
Introduction to Engineering MATLAB – 6 Script Files - 1 Agenda Script files.
CIS 90 - Lesson 15 Lesson Module Status Slides – Properties - Flash cards – No-stress quiz – Web calendar summary – Web book pages – Commands – Lab – done.
Session 2 Wharton Summer Tech Camp Basic Unix. Agenda Cover basic UNIX commands and useful functions.
Temple University Training Acoustic model using Sphinx Train Jaykrishna shukla,Mubin Amehed& cara Santin Department of Electrical and Computer Engineering.
Outline Comparison of Excel and R R Coding Example – RStudio Environment – Getting Help – Enter Data – Calculate Mean – Basic Plots – Save a Coding Script.
Introduction to MATLAB ENGR 1181 MATLAB 1. Opening MATLAB  Students, please open MATLAB now.  CLICK on the shortcut icon → Alternatively, select… start/All.
Configuring IQmol for Windows machines, use version!
Elliott Links Edward M. Kwang President. Elliott Notes, Attributes & Links Notes & Attributes –Expand Elliott database without modification –They are.
How to create and install packages in R Presenter: Roman Jandarov
If you use it, cite it.
Reading and Mapping Aerosol Data From a MODIS HDF4 File
Lec4 (Practical) Computer Security-Safe Use Windows Vista Parental Settings.
Java Programming, Second Edition Appendix A Working with Java SDK 1.4.
Lab 0 / Chapter 0 Windows XP Environment. 2 User Interfaces: A different perspective.
Introduction to R Introductions What is R? RStudio Layout Summary Statistics Your First R Graph 17 September 2014 Sherubtse Training.
Operating Systems Written by: Tim Keyser Georgia CTAE Resource Network 2010.
Shell Interface Shell Interface Functions Data. Graphical Interface Graphical Interface Command-line Interface Command-line Interface Experiments Private.
Bioinformatics for biologists Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University Presented.
Pedro Arce GAMOS primer 1 Exercise 0: installation Open a web browser in Go to GAMOS web page : search for Geant4  Applications.
Remote Access Usages. Remote Desktop Remote desktop technology makes it possible to view another computer's desktop on your computer. This means you can.
Getting your STK license for METC 106 Go to the website and download STK 9.2.3http://
Practical Kinetics Exercise 0: Getting Started Objectives: 1.Install Python and IPython Notebook 2.print “Hello World!”
Module 6 Creating and Configuring Group Policy. Module Overview Overview of Group Policy Configuring the Scope of Group Policy Objects Evaluating the.
Xi Wang Yang Zhang. 1. Easy to learn 2. Clean and readable codes 3. A lot of useful packages, especially for web scraping and text mining 4. Growing popularity.
1 Project 3: Farey Sequences. 2 Farey Sequences Further details about Farey sequences, and more examples, can be found in the Wikipedia article:
Math 252: Math Modeling Eli Goldwyn Introduction to MATLAB.
1 Project 4: Palindrome Detector. 2 Assignment Write a C++ program that reads a line of text from the keyboard and reports to the user whether the text.
Tomcat Setup BCIS 3680 Enterprise Programming. One-Click Tomcat Setup 2  This semester we’ll try to set up Tomcat with a PowerShell script.  Preparation.
Dreamweaver – Setting up a Site and Page Layouts Web Design Section 7-2 Part or all of this lesson was adapted from the University of Washington’s “Web.
Learning Unix/Linux Based on slides from: Eric Bishop.
Optical Flow walk through Aidean Sharghi Spring 14.
 is a set of instructions that tell the computer what to do. Software can be categorized into: 1. Operating system software 2. Applications software.
 CSC 215 : Procedural Programming with C C Compilers.
GETTING STARTED WITH AWS AND PYTHON. OUTLINE  Intro to Boto  Installation and configuration  Working with AWS S3 using Bot  Working with AWS SQS using.
Subversion Subversion is a brand of version control software that is frequently used to store the code and documentation of a project so as to permit.
CSC 215 : Procedural Programming with C
Feature Engineering Studio
Moodle: Tune up your Moodle Installation
Stubbs Lab Bioinformatics - 2 Retrieving sequence data files and Linux commands Nov 17, 2016 Joe Troy.
Navigating the Filing System
Andy Wang Object Oriented Programming in C++ COP 3330
Perl A simple test.
FTP and UNIX TOPICS Exploring your Web Hosting Site FTP UNIX
Content: What is JMeter? What can I do with JMeter?
Working with the Command-Line Interfaced
At Software Download Help, you will get to know about all the processes from downloading to installation at just one place without wasting even a bit of.
Modification of the bioperl script for parsing BLAST output
IOTA HOW TO START BUILDING.
Code is on the Website Outline Comparison of Excel and R
Andy Wang Object Oriented Programming in C++ COP 3330
Yung-Hsiang Lu Purdue University
Using Script Files and Managing Data
L L Line CSE 420 Computer Games Lecture #3 Introduction to Python.
The Python interpreter
Presentation transcript:

Using GC content to distinguish Phytophthora sequences from tomato sequences

Mission #1 Calculate the GC content of each sequence in the Phytophthora-tomato interactome We will use a perl script to accomplish the mission.

Preparation Download the perl script (gc.pl) from the class web site and store it in C:/BioDownload folder

Open cygwin, or command prompt (Vista users), or terminal (Mac users) Change directory (cd) to the BioDownload folder perl gc.pl PhytophSeq1.txt phyto_gc.out Running the script

In cygwin (Windows users) or terminal (Mac users) grep --perl-regexp ”\t” -c phytoph_gc.out grep ”>” -c PhytophSeq1.txt You should get the same number from the two commands. The number should be Results

The output file GC content column Name column

Build a histogram of the values of GC content We will use R program to accomplish this mission. Mission #2

Mac users

All Windows users

XP users Vista users

getwd() to know which folder you are in now

setwd(“c:/BioDownload”) to change the working directory to C:/BioDownload setwd(“/path/to/biodownload”) for Mac users

data<-read.table(“phytoph_gc.out”,sep=“\t”,header=FALSE) to read in the data in the file phytoph_gc.out (your file name may be different)

data[1:10,] to see the first 10 lines of the vector “data”

gc<-data[,2] to assign the values from the 2 nd column of “data” to a new vector “gc”

summary(gc) to get the summary of the values in the vector “gc”

hist(gc,breaks=58) to draw a histogram of the values in “gc” vector Breaks indicates how many cells you want for the histogram. It was calculated as 78.7 (max) (min). It means the bin of the histogram is ~ 1 GC value

hist(gc,breaks=58,xlab=“GC content”,ylim=range(c(0,400)),main=“Histogram of GC content of sequences\ninPhytophthora-tomato interactome”) to make the histogram look better

>pdf(“gc_histogram.pdf”) >hist(gc,breaks=58,xlab=“GC content”,ylim=range(c(0,400)),main=“Histogram of GC content of sequences\ninPhytophthora-tomato interactome”) >dev.off() To output the histogram to a PDF file.

location file