r-seurat-data
|
public |
Single cell RNA sequencing datasets can be large, consisting of matrices that contain expression data for several thousand features across several thousand cells. This package is designed to easily install, manage, and learn about various single-cell datasets, provided Seurat objects and distributed as independent packages.
|
2024-12-14 |
quicktree
|
public |
Fast implementation of the neighbour-joining phylogenetic inference method
|
2024-12-14 |
ucsc-qactoqa
|
public |
convert from compressed to uncompressed
|
2024-12-14 |
ropebwt2
|
public |
Incremental construction of FM-index for DNA sequences
|
2024-12-14 |
enano
|
public |
ENANO is a FASTQ compression tool especially designed for nanopore sequencing FASTQ files.
|
2024-12-14 |
seqfu
|
public |
DNA sequence utilities
|
2024-12-14 |
newick_utils
|
public |
The Newick Utilities are a suite of Unix shell tools for processing phylogenetic trees. We distribute the package under the BSD License. Functions include re-rooting, extracting subtrees, trimming, pruning, condensing, drawing (ASCII graphics or SVG).
|
2024-12-14 |
sgdemux
|
public |
Tool for demultiplexing sequencing data generated on Singular Genomics' sequencing instruments.
|
2024-12-14 |
mapad
|
public |
An aDNA aware short-read mapper
|
2024-12-14 |
skani
|
public |
skani is a fast and robust tool for calculating ANI between metagenome assembled genomes and contigs.
|
2024-12-14 |
mapping-iterative-assembler
|
public |
Consensus calling or `reference assisted assembly`, chiefly of ancient mitochondria.
|
2024-12-14 |
r-intego
|
public |
An unsupervised gene clustering algorithm based on the integration of external biological knowledge, such as Gene Ontology annotations, into expression data.
|
2024-12-14 |
htslib
|
public |
C library for high-throughput sequencing data formats.
|
2024-12-14 |
bbknn
|
public |
Batch balanced KNN
|
2024-12-14 |
ucsc-genepredtoprot
|
public |
create protein sequences by translating gene annotations
|
2024-12-14 |
wgsim
|
public |
No Summary
|
2024-12-14 |
deepchopper-cli
|
public |
A CLI for Genomic Language Model for Chimera Artifact Detection in Nanopore Direct RNA Sequencing.
|
2024-12-14 |
atac
|
public |
No Summary
|
2024-12-14 |
table2asn
|
public |
table2asn is a command-line program that creates sequence records for submission to GenBank - replaces tbl2asn.
|
2024-12-14 |
slow5tools
|
public |
Toolkit for S/bLOW5 format
|
2024-12-14 |
ucsc-bedpileups
|
public |
Find (exact) overlaps if any in bed input
|
2024-12-14 |
cansam
|
public |
C++ binding for SAM/BAM files
|
2024-12-14 |
perl-time-hires
|
public |
High resolution alarm, sleep, gettimeofday, interval timers
|
2024-12-14 |
bowtie
|
public |
An ultrafast memory-efficient short read aligner
|
2024-12-14 |
perl-digest-sha1
|
public |
Perl interface to the SHA-1 algorithm
|
2024-12-14 |
ucsc-fatrans
|
public |
Translate DNA .fa file to peptide
|
2024-12-14 |
contrafold
|
public |
CONditional TRAining for RNA Secondary Structure Prediction
|
2024-12-14 |
ucsc-checkagpandfa
|
public |
takes a .agp file and .fa file and ensures that they are in synch
|
2024-12-14 |
perl-unicode-normalize
|
public |
Unicode Normalization Forms
|
2024-12-14 |
ucsc-mafmefirst
|
public |
Move component to top if it is one of the named ones.
|
2024-12-14 |
meta-neuro
|
public |
Medial Tractography Analysis (MeTA)
|
2024-12-14 |
fasttree
|
public |
FastTree infers approximately-maximum-likelihood phylogenetic trees from alignments of nucleotide or protein sequences
|
2024-12-14 |
ucsc-chainantirepeat
|
public |
Get rid of chains that are primarily the results of repeats and degenerate DNA
|
2024-12-14 |
crass
|
public |
Crass (The CRISPR Assembler) is a program that searches through raw metagenomic reads for Clustered Regularly Interspersed Short Palindromic Repeats
|
2024-12-14 |
ucsc-hgsqldump
|
public |
Execute mysqldump using passwords from .hg.conf
|
2024-12-14 |
perl-unicode-map
|
public |
Map charsets from and to UTF-16 unicode
|
2024-12-14 |
ucsc-mafspecieslist
|
public |
Scan maf and output all species used in it.
|
2024-12-14 |
ucsc-pslrc
|
public |
reverse-complement psl
|
2024-12-14 |
meraculous
|
public |
Meraculous is a whole genome assembler for Next Generation Sequencing data, geared for large genomes. It's hybrid k-mer/read-based approach capitalizes on the high accuracy of Illumina sequence by eschewing an explicit error correction step which we argue to be redundant with the assembly process. Meraculous achieves high performance with large datasets by utilizing lightweight data structures and multi-threaded parallelization, allowing to assemble human-sized genomes on a high-cpu cluster in under a day. The process pipeline implements a highly transparent and portable model of job control and monitoring where different assembly stages can be executed and re-executed separately or in unison on a wide variety of architectures.
|
2024-12-14 |
hisat2
|
public |
Graph-based alignment of next generation sequencing reads to a population of genomes.
|
2024-12-14 |
ucsc-bedgraphpack
|
public |
Pack together adjacent records representing same value.
|
2024-12-14 |
perl-convert-binary-c
|
public |
Binary Data Conversion using C Types
|
2024-12-14 |
sketchy
|
public |
Real-time lineage hashing and genotyping of bacterial pathogens
|
2024-12-14 |
parafly
|
public |
Given a file containing a list of unix commands, multithreading is used to process the commands in parallel on a single server. Success/failure is captured, and failed commands are retained and reported.
|
2024-12-14 |
phyml
|
public |
Phylogenetic estimation using (Maximum) Likelihood
|
2024-12-14 |
ucsc-maftoaxt
|
public |
Convert from maf to axt format
|
2024-12-14 |
irfinder
|
public |
Intron Retention Finder
|
2024-12-14 |
ucsc-genepredtobed
|
public |
Convert from genePred to bed format. Does not yet handle genePredExt
|
2024-12-14 |
tntblast
|
public |
Searching DNA/RNA sequence databases with PCR and/or probe queries
|
2024-12-14 |
ucsc-chaincleaner
|
public |
Remove chain-breaking alignments from chains that break nested chains.
|
2024-12-14 |