KisSplice

A local transcriptome assembler for SNPs, indels and AS events

Downloading KisSplice imposes the full acceptance of the CeCILL license.

Requirements

KisSplice is written in C++ and Python. It works on Linux and Mac 64b systems. It also requires Python 2.7.x

Version 2.4.0-p1

Kissplice Version 2.4.0-p1 (2016-04-14)

  • Bugs fixed
    • Fixed a bug that happened if the user uses the --min_overlap parameter.

Version 2.4.0

Kissplice Version 2.4.0 (2016-01-08)

  • Improvements
    • Added a new experimental algorithm that searches for bubbles by listing all paths. The time and memory that may be used by this new algorithm is not bounded by a polynomial function on the input size, but it worked well on some instances in practice. To use this algorithm instead of the default one, it is needed to call KisSplice with parameter --experimental (if you want to limit the amount of memory for this algorithm, see --max-memory parameter);
    • Improved the way KisSplice classifies bubbles: 1) indels (Type 3) now have a shorter path length of at maximum 2k; 2) now the maximum 10% hamming distance to classify a bubble as Type_0b considers only the variable part of the upper path;
    • Bubbles are now output before and after read coherency module (enabling the users to access them before this step, which is sometimes the bottleneck of the pipeline);
    • Added a timestamp to the main messages of the pipeline and a summary log file in the results folder.
  • Parameters
    • Modified the -s parameter. -s now should be always followed by an integer and it changes which types of SNPs will be output. If 0 (default), will not output SNPs. If 1, will output Type0a-SNPs. If 2, will output Type0a and Type0b SNPs;
    • -m (LL_MIN) from 2k-6 to 2k-8. So, more bubbles will be found now;
    • -M (UL_MAX) from 10000 to 1000000. This enables KisSplice to find very long and rare events, and it does not impact much in runtime;
    • --timeout now defaults to 100000s;
    • -C from 0.02 to 0.05;
    • Added --keep-bccs parameter which indicates if KisSplice should keep the node/edges files for all BCCs after execution;
    • Added --output-branch-count parameter which indicates if KisSplice should output the number of branching nodes in each path;
    • Added --keep-counts parameter which indicates if KisSplice should keep the .counts file after the sequencing-errors-removal step.
  • Bugs fixed
    • Fixed minor bugs in the compression steps. This caused some rare bubbles to be missed;
    • Fixed some bugs related to the read coherence and coverage step. This may change the counts of the events and their classification whether coherent or uncoherent.

Version 2.3.1

Kissplice Version 2.3.1 (2014-12-18)

  • Improvements
    • KissReads module has been modified and sped up, with a significant impact on run times.
  • Parameters
    • --timeout default now at 10000: in big datasets, recall can be increased while run time is a bit longer.
  • Bugs fixed
    • Reads containing only 'N': the graph construction was stopped if the file contained a read composed only of 'N's. This is was a silence bug, no error message was produced.
    • Problems compiling with new versions of MAC OSX (10.8+): KisSplice is now compiling with the new default C++ compiler of OSX 10.8+.

Version 2.2.1

Kissplice Version 2.2.1 (2014-07-24)

  • Bug fixed : when a fasta/fastq file contains empty lines, events might have been misclassed (into 'incoherent') by KissReads. With this version these lines are ignored, thus KisSplice results will change on datasets with empty lines, with more events classified into types 0,1,2,3 and less into incoherent.
  • This version also integrates a correction that affects only the compilation and the installation of KisSplice in some cases and does not have impact on KiSplice results.

Version 2.2.0

Kissplice Version 2.2.0 (2014-06-12)

  • New enumeration algorithm, dealing with highly complex regions in the graph due to repeats.

    Sensitivity is increased : the new enumeration enables to find more events (it has been validated on simulated data, see Navigating in a sea of repeats in RNA-seq without drowning), as the complex part created by repeats may represent half of the size of the graph and was not accessible before. Specificty is also slightly increased.

  • Options and parameters :
    • Note that KisSplice no longer outputs SNPs by default. The flag (-s) outputs SNPs.
    • The maximum number of cycles (max_cycles) increased from 10k to 100M, the timeout increased from 900s to 3600s, which fits better to the size standards of RNA-seq datasets.

Version 2.1.0

Kissplice Version 2.1.0 (2014-01-29)
New performances, results and options :

  • Outputs :
    • The command line is returned in the output (.o) file.
    • Type_0 results file now presents two types of variants :
      • Type_0a where only one single nucleotide change is returned in the upper path
      • Type_0b where several single nucleotide changes are returned in the upper path
    In case this "type 0" variant would be a SNP, we hope it will help users in their SNP characterization.
  • Pipeline features :
    • 2.1 version is able to catch variants that were not found in last versions, so we strongly incitate users to run again their former jobs as they may end up with more results.
    • "N" nucleotides are now ignored by the pipeline during graph building, enabling to work efficiently with masked repeats in data for instance.
    • Graph building and storing improved.
  • Options and parameters :
    • Default parameters have changed : k moves from 25 to 41 and M from 1000 to 10000. Be aware that this will change the outputs. With a larger value of k, it is expected that the specifity of KisSplice will increase (all repeats smaller than k are solved), while the sensitivity will decrease (variants poorly expressed may be not fully assembled).
    • New option --mismatches makes possible to allow some mismatches while KissReads' reads alignment on results.
    • Unfinished bcc stored by default.

Version 2.0.0

Kissplice Version 2.0.0 (2013-11-18)
This version comes with important changes :

Version 1.8.3

KisSplice version 1.8.3-p1 (2013-10-08, patched version)

  • Bug fix: it is now possible to compile for k > 31

KisSplice version 1.8.3 (2013-06-24)

  • New build organization (no feature changes)

Version 1.8.2

KisSplice version 1.8.2 (2013-06-24 )

  • KisSplice 1.8.2 handles compressed reads (.gz)
  • Events quantification is more stable

Version 1.8.1

KisSplice version 1.8.1 (2013-04-05 )

  • KisSplice 1.8 can be compiled for k > 31
  • Dynamic error removal is now done by default

Version 1.8.0

KisSplice version 1.8.0 (2013-03-25)

  • De-Bruijn Graph construction is done with minia which uses much less memory (5 Gb for 100M reads). KisSplice can now be applied to very large projects (up to 1G reads)
  • Dynamic error removal : a pre-processing step to remove edges with low relative coverage, which are likely sequencing errors ( option -C )
  • Uniform criteria to stop the enumeration: results can be different from version 1.7.1

Version 1.7.1

kisSplice version 1.7.1 (2012-12-01 )