r-kriging
|
public |
Simple and highly optimized ordinary kriging algorithm to plot geographical data
|
2025-09-22 |
llama-cloud-services
|
public |
Tailored SDK clients for LlamaCloud services.
|
2025-09-22 |
mne-lsl
|
public |
Real-time framework integrated with MNE-Python for online neuroscience research through LSL-compatible devices.
|
2025-09-22 |
r-densityclust
|
public |
An improved implementation (based on k-nearest neighbors) of the density peak clustering algorithm, originally described by Alex Rodriguez and Alessandro Laio (Science, 2014 vol. 344). It can handle large datasets (> 100, 000 samples) very efficiently. It was initially implemented by Thomas Lin Pedersen, with inputs from Sean Hughes and later improved by Xiaojie Qiu to handle large datasets with kNNs.
|
2025-09-22 |
r-udpipe
|
public |
This natural language processing toolkit provides language-agnostic 'tokenization', 'parts of speech tagging', 'lemmatization' and 'dependency parsing' of raw text. Next to text parsing, the package also allows you to train annotation models based on data of 'treebanks' in 'CoNLL-U' format as provided at <https://universaldependencies.org/format.html>. The techniques are explained in detail in the paper: 'Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPipe', available at <doi:10.18653/v1/K17-3009>. The toolkit also contains functionalities for commonly used data manipulations on texts which are enriched with the output of the parser. Namely functionalities and algorithms for collocations, token co-occurrence, document term matrix handling, term frequency inverse document frequency calculations, information retrieval metrics (Okapi BM25), handling of multi-word expressions, keyword detection (Rapid Automatic Keyword Extraction, noun phrase extraction, syntactical patterns) sentiment scoring and semantic similarity analysis.
|
2025-09-22 |
r-lokern
|
public |
Kernel regression smoothing with adaptive local or global plug-in bandwidth selection.
|
2025-09-22 |
r-anomalydetection
|
public |
Implements procedures developed by Gutierrez et al. (2017, <https://journal.r-project.org/archive/2017/RJ-2017-039/index.html>) to aid in detecting network log anomalies. By combining various multivariate analytic approaches relevant to network anomaly detection, it provides cyber analysts efficient means to detect suspected anomalies requiring further evaluation.
|
2025-09-22 |
ome-zarr-models
|
public |
A minimal Python package for reading OME-Zarr (meta)data
|
2025-09-22 |
curated-tokenizers
|
public |
Lightweight piece tokenization library
|
2025-09-22 |
hyp3_sdk
|
public |
A python wrapper around the HyP3 API
|
2025-09-22 |
r-topicmodels
|
public |
Provides an interface to the C code for Latent Dirichlet Allocation (LDA) models and Correlated Topics Models (CTM) by David M. Blei and co-authors and the C++ code for fitting LDA models using Gibbs sampling by Xuan-Hieu Phan and co-authors.
|
2025-09-22 |
r-turfr
|
public |
Package for analyzing TURF (Total Unduplicated Reach and Frequency) data in R. No looping in TURF algorithm results in fast processing times. Allows for individual-level weights, depth specification, and user-truncated combination set(s). Allows user to substitute Monte Carlo simulated combination set(s) after set(s) exceed a user-specified limit.
|
2025-09-22 |
r-treeheatr
|
public |
Creates interpretable decision tree visualizations with the data represented as a heatmap at the tree's leaf nodes. 'treeheatr' utilizes the customizable 'ggparty' package for drawing decision trees.
|
2025-09-22 |
r-coranking
|
public |
Calculates the co-ranking matrix to assess the quality of a dimensionality reduction.
|
2025-09-22 |
r-optm
|
public |
The popular population genetic software 'Treemix' by 'Pickrell and Pritchard' (2012) <DOI:10.1371/journal.pgen.1002967> estimates the number of migration edges on a population tree. However, it can be difficult to determine the number of migration edges to include. Previously, it was customary to stop adding migration edges when 99.8% of variation in the data was explained, but 'OptM' automates this process using an ad hoc statistic based on the second-order rate of change in the log likelihood. 'OptM' also has added functionality for various threshold modeling to compare with the ad hoc statistic.
|
2025-09-22 |
r-mvn
|
public |
Performs multivariate normality tests and graphical approaches and implements multivariate outlier detection and univariate normality of marginal distributions through plots and tests, and performs multivariate Box-Cox transformation (Korkmaz et al, (2014), <https://journal.r-project.org/archive/2014-2/korkmaz-goksuluk-zararsiz.pdf>).
|
2025-09-22 |
cargo-c
|
public |
build and install C-compatible libraries
|
2025-09-22 |
r-msos
|
public |
Multivariate Analysis methods and data sets used in John Marden's book Multivariate Statistics: Old School (2015) <ISBN:978-1456538835>. This also serves as a companion package for the STAT 571: Multivariate Analysis course offered by the Department of Statistics at the University of Illinois at Urbana-Champaign ('UIUC').
|
2025-09-22 |
r-gfd
|
public |
Implemented are the Wald-type statistic, a permuted version thereof as well as the ANOVA-type statistic for general factorial designs, even with non-normal error terms and/or heteroscedastic variances, for crossed designs with an arbitrary number of factors and nested designs with up to three factors. Friedrich et al. (2017) <doi:10.18637/jss.v079.c01>.
|
2025-09-22 |
r-emayili
|
public |
A light, simple tool for sending emails with minimal dependencies.
|
2025-09-22 |
r-isa2
|
public |
The ISA is a biclustering algorithm that finds modules in an input matrix. A module or bicluster is a block of the reordered input matrix.
|
2025-09-22 |
r-shinyheatmaply
|
public |
Access functionality of the 'heatmaply' package through 'Shiny UI'.
|
2025-09-22 |
r-finch
|
public |
Parse and create Darwin Core (<http://rs.tdwg.org/dwc/>) Simple and Archives. Functionality includes reading and parsing all the files in a Darwin Core Archive, including the datasets and metadata; read and parse simple Darwin Core files; and validation of Darwin Core Archives.
|
2025-09-22 |
easycrystallography
|
public |
Crystallography in EasyScience
|
2025-09-22 |
mapflow
|
public |
A Python library to export xarray.DataArray as video files
|
2025-09-22 |