Difference between revisions of "DiffKAP"

Revision as of 00:59, 2 December 2011

We have developed a Differential kmer Analysis Pipeline (DiffKAP) for the pairwise comparison of RNA profiles between metatranscriptomes which does not rely on mapping to reference assemblies. By reducing each read to component kmers and assessing the frequency of these sequences, we overcome statistical limitations on the lack of identical reads for pairwise comparison between samples and allow inference of differential gene expression for annotated reads.

The DiffKAP application consists of a series of scripts written in Perl and Linux shell scripts and requires Jellyfish [Marcais 2011] and BLASTx as well as access to a copy of the SwissProt database. The scripts are freely available for non-commercial use.

What does DiffKAP depend on?

DiffKAP depends on the following things:

Jellyfish for fast kmer counting
blastx for sequence alignment
Some non-standard Perl modules:
- bioperl
  - Bio::SeqIO
  - Bio::SearchIO
- Parallel::ForkManager
- Statistics::Descriptive
- Config::IniFiles
- GD::Graph::linespoints (for the script identifyKmerSize)

How to install?

Download the DiffKAP package
Uncompress it into:
- a DiffKAP setup file
- a README file
- a VERSION file
- an example data folder containing a small subset of a metatranscriptomic data
read the README
Install the DiffKAP setup script by typing: DiffKAP_setup
*** If you like, you can add the DiffKAP path to $PATH or just use an absolute path for running DiffKAP ***

How to run?

Use the example configuration file in the sample data directory as a template to create a configuration file suitable for your project
Run DiffKAP with your configuration file as an input argument, for example: DiffKAP ~/sampleProj/sampleProj.cfg

Results will be generated in the [OUT_DIR]/results where [OUT_DIR] is defined in the config file.
The processing log is stored in /tmp/DiffKAP.log by default.

Q&A

Reference

Marçais, G. and Kingsford, C. (2011) A fast, lock-free approach for efficient parallel counting of occurrences of k-mers, Bioinformatics, 27, 764-770.

Back to Main_Page

@@ Line 6: / Line 6: @@
 == What does DiffKAP depend on? ==
 DiffKAP depends on the following things:
-* [http://www.cbcb.umd.edu/software/jellyfish jellyfish] for fast kmer counting
+* [http://www.cbcb.umd.edu/software/jellyfish Jellyfish] for fast kmer counting
 * blastx for sequence alignment
 * Some non-standard Perl modules:
@@ Line 15: / Line 15: @@
 ** Statistics::Descriptive
 ** Config::IniFiles
+** GD::Graph::linespoints  (for the script identifyKmerSize)
@@ Line 39: / Line 40: @@
 *
+== Reference ==
+* Marçais, G. and Kingsford, C. (2011) A fast, lock-free approach for efficient parallel counting of occurrences of k-mers, Bioinformatics, 27, 764-770.
 Back to [[Main_Page]]

Difference between revisions of "DiffKAP"

Revision as of 00:59, 2 December 2011

Contents

What does DiffKAP depend on?

How to install?

How to run?

Q&A

Reference

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools