GitHub - CBB752Spring2016/CBB752_Final_Project_1.2: Tool that generates “quality control statistics” from FastQ file

README for Quality Statistics

Python tool that generates quality control statistics from FastQ file. A tool that accomplishes this task in the language R can be found [here] (https://github.com/dspak/CBB752_Final_Project_1.2). This tool is part of a set of bioinformatic and biological structure tools created for CBB752 at Yale University in the Spring 2016. The website containing the set of tools can be found [here] (http://cbb752spring2016.github.io).

General

The tool is named qualitystats.py. It takes one required input (name of the fastq file to be processed) and one optional input (the name of the txt file to which the filename and titles of the corresponding plots are output). This tool creates png files of the following plots:

Distribution of Read Lengths
Per base quality score containing the median, quartiles, mean, and standard deviation
Distribution of mean quality per sequence

Usage

Usage: python3 qualitystats.py -i < input file > -o < output file >

Examples:

# Usage from terminal:
python3 qualitystats.py -i sample-input.fastq -o sample-output.txt
python3 qualitystats.py -i sample-input.fastq

Input and Output formats

Input Formats:

input (-i): string of corresponding fastq file
output (-o): string containing the name of the file to which the output information is saved

Output Format: txt file containing the file name, the number of sequences and the titles of corresponding plots

Sample Output

Quality Score Statistics and Figure names for sample-input.fastq

Number of Sequences = 100000

For the Distribution of read lengths see: Sequence_Length_Distribution.png

For a plot of the per base quality score see: Per_Base_Sequence_Quality.png

For a plot of the distribution of mean quality per sequence see: Per_Sequence_Mean_Quality_Distribution.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

README for Quality Statistics

General

Usage

Usage: python3 qualitystats.py -i < input file > -o < output file >

Examples:

Input and Output formats

Sample Output

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
Per_Base_Sequence_Quality.png		Per_Base_Sequence_Quality.png
Per_Sequence_Mean_Quality_Distribution.png		Per_Sequence_Mean_Quality_Distribution.png
README.md		README.md
Sequence_Length_Distribution.png		Sequence_Length_Distribution.png
qualitystats.py		qualitystats.py
sample-input.fastq		sample-input.fastq
sample-input_1K.fastq		sample-input_1K.fastq
sample-output.txt		sample-output.txt

CBB752Spring2016/CBB752_Final_Project_1.2

Folders and files

Latest commit

History

Repository files navigation

README for Quality Statistics

General

Usage

Usage: python3 qualitystats.py -i < input file > -o < output file >

Examples:

Input and Output formats

Sample Output

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages