r-voxr
|
public |
Tools for tree crown structure description based on T-LiDAR data voxelisation
|
2023-06-16 |
r-vortexrdata
|
public |
Contains selected data from two publications, Campbell 'et' 'al'. (2016) <DOI:10.1080/14486563.2015.1028486> and 'Pacioni' 'et' 'al'. (2017) <DOI:10.1071/PC17002>. The data is provided both as raw outputs from the population viability analysis software 'Vortex' and packaged as R objects. The R package 'vortexR' uses the raw data provided here to illustrate its functionality of parsing raw 'Vortex' output into R objects.
|
2023-06-16 |
r-vdspcalibration
|
public |
Provides statistical methods for the design and analysis of a calibration study, which aims for calibrating measurements using two different methods. The package includes sample size calculation, sample selection, regression analysis with error-in measurements and change-point regression. The method is described in Tian, Durazo-Arvizu, Myers, et al. (2014) <DOI:10.1002/sim.6235>.
|
2023-06-16 |
r-vcd
|
public |
Visualization techniques, data sets, summary and inference procedures aimed particularly at categorical data. Special emphasis is given to highly extensible grid graphics. The package was package was originally inspired by the book "Visualizing Categorical Data" by Michael Friendly and is now the main support package for a new book, "Discrete Data Analysis with R" by Michael Friendly and David Meyer (2015).
|
2023-06-16 |
r-usgsstates2k
|
public |
A map of the USA from the United States Geological Survey (USGS). Irucka worked with this data set while a Cherokee Nation Technology Solutions (CNTS) USGS Contractor and/or USGS employee. It is replaced by 'states2k'.
|
2023-06-16 |
r-zendeskr
|
public |
This package provides an R wrapper for the Zendesk API
|
2023-06-16 |
r-xtal
|
public |
This is the tool set for crystallographer to design and analyze crystallization experiments, especially for ribosome from Mycobacterium tuberculosis.
|
2023-06-16 |
r-xplorerr
|
public |
Tools for interactive data exploration built using 'shiny'. Includes apps for descriptive statistics, visualizing probability distributions, inferential statistics, linear regression, logistic regression and RFM analysis.
|
2023-06-16 |
r-wtss
|
public |
An R client that provides remote access to satellite image time series. The client allows Earth observation users to obtain time series from data sets available in a Web Time Series Server. The functions include: (a) listing the data sets available in the server; (b) describing the contents of a data set; (c) retrieving a time series based on spatial location and temporal filters.
|
2023-06-16 |
r-word.alignment
|
public |
For a given Sentence-Aligned Parallel Corpus, it aligns words for each sentence pair. It considers one-to-many and symmetrization alignments. Moreover, it evaluates the quality of word alignment based on this package and some other software. It also builds an automatic dictionary of two languages based on given parallel corpus.
|
2023-06-16 |
r-wnnsel
|
public |
New tools for the imputation of missing values in high-dimensional data are introduced using the non-parametric nearest neighbor methods. It includes weighted nearest neighbor imputation methods that use specific distances for selected variables. It includes an automatic procedure of cross validation and does not require prespecified values of the tuning parameters. It can be used to impute missing values in high-dimensional data when the sample size is smaller than the number of predictors. For more information see Faisal and Tutz (2017) <doi:10.1515/sagmb-2015-0098>.
|
2023-06-16 |
r-wikipedir
|
public |
A wrapper for the MediaWiki API, aimed particularly at the Wikimedia 'production' wikis, such as Wikipedia. It can be used to retrieve page text, information about users or the history of pages, and elements of the category tree.
|
2023-06-16 |
r-weightedroc
|
public |
Fast computation of Receiver Operating Characteristic (ROC) curves and Area Under the Curve (AUC) for weighted binary classification problems (weights are example-specific cost values).
|
2023-06-16 |
r-waiter
|
public |
Full screen splash loading screens for 'Shiny'.
|
2023-06-16 |
r-vudc
|
public |
Contains functions for visualization univariate data: ccdplot and qddplot.
|
2023-06-16 |
r-volleystat
|
public |
Volleyball match statistics of the German volleyball first division league (seasons 2013/2014 to 2018/2019). The data has been collected from the official volleyball first division homepage (<www.volleyball-bundesliga.de>) and contains information on teams, staff, sets, matches, and player-in-match statistics (extracted automatically from the official match reports).
|
2023-06-16 |
r-valaddin
|
public |
A set of basic tools to transform functions into functions with input validation checks, in a manner suitable for both programmatic and interactive use.
|
2023-06-16 |
r-unitizer
|
public |
Simplifies regression tests by comparing objects produced by test code with earlier versions of those same objects. If objects are unchanged the tests pass, otherwise execution stops with error details. If in interactive mode, tests can be reviewed through the provided interactive environment.
|
2023-06-16 |
r-unikn
|
public |
Define and use graphical elements of corporate design manuals in R. The 'unikn' package provides color functions (by defining dedicated colors and color palettes, and commands for changing, viewing, and using them) and styled text elements (e.g., for marking, underlining, or plotting colored titles). The pre-defined range of colors and text functions is based on the corporate design of the University of Konstanz <https://www.uni-konstanz.de/>, but can be adapted and extended for other institutions and purposes.
|
2023-06-16 |
r-unicode
|
public |
Data from Unicode 12.0.0 and related utilities.
|
2023-06-16 |
r-udderquarterinfectiondata
|
public |
The udder quarter infection data set contains infection times of individual cow udder quarters with Corynebacterium bovis (Laevens et al. 1997 <DOI:10.3168/jds.S0022-0302(97)76295-7>). Obviously, the four udder quarters are clustered within a cow, and udder quarters are sampled only approximately monthly, generating interval-censored data. The data set contains both covariates that change within a cow (e.g., front and rear udder quarters) and covariates that change between cows (e.g., parity [the number of previous calvings]). The correlation between udder infection times within a cow also is of interest, because this is a measure of the infectivity of the agent causing the disease. Various models have been applied to address the problem of interdependence for right-censored event times. These models, as applied to this data set, can be found back in the publications found in the reference list.
|
2023-06-16 |
r-twosampletest.hd
|
public |
For high-dimensional data whose main feature is a large number, p, of variables but a small sample size, the null hypothesis that the marginal distributions of p variables are the same for two groups is tested. We propose a test statistic motivated by the simple idea of comparing, for each of the p variables, the empirical characteristic functions computed from the two samples. If one rejects this global null hypothesis of no differences in distributions between the two groups, a set of permutation p-values is reported to identify which variables are not equally distributed in both groups.
|
2023-06-16 |
r-tuple
|
public |
Functions to find all matches or non-matches, orphans, and duplicate or other replicated elements.
|
2023-06-16 |
r-tuckerr.mmgg
|
public |
Performs Three-Mode Principal Components Analysis, which carries out Tucker Models.
|
2023-06-16 |
r-tsdisagg2
|
public |
Disaggregates low frequency time series data to higher frequency series. Implements the following methods for temporal disaggregation: Boot, Feibes and Lisman (1967) <DOI:10.2307/2985238>, Chow and Lin (1971) <DOI:10.2307/1928739>, Fernandez (1981) <DOI:10.2307/1924371> and Litterman (1983) <DOI:10.2307/1391858>.
|
2023-06-16 |
r-trueskill
|
public |
An implementation of the TrueSkill algorithm (Herbrich, R., Minka, T. and Grapel, T) in R; a Bayesian skill rating system with inference by approximate message passing on a factor graph. Used by Xbox to rank gamers and identify appropriate matches. http://research.microsoft.com/en-us/projects/trueskill/default.aspx Current version allows for one player per team. Will update as time permits. Requires R version 3.0 as it is written with Reference Classes. URL: https://github.com/bhoung/trueskill-in-r Acknowledgements to Doug Zongker and Heungsub Lee for their python implementations of the algorithm and for the liberal reuse of Doug's code comments (@dougz and @sublee on github).
|
2023-06-16 |
r-transcriber
|
public |
Transcribes audio to text with the HP IDOL API. Includes functions to upload files, retrieve transcriptions, and monitor jobs.
|
2023-06-16 |
r-tmpm
|
public |
Trauma Mortality prediction for ICD-9, ICD-10, and AIS lexicons in long or wide format based on Dr. Alan Cook's tmpm mortality model.
|
2023-06-16 |
r-titanic
|
public |
This data set provides information on the fate of passengers on the fatal maiden voyage of the ocean liner "Titanic", summarized according to economic status (class), sex, age and survival. Whereas the base R Titanic data found by calling data("Titanic") is an array resulting from cross-tabulating 2201 observations, these data sets are the individual non-aggregated observations and formatted in a machine learning context with a training sample, a testing sample, and two additional data sets that can be used for deeper machine learning analysis. These data sets are also the data sets downloaded from the Kaggle competition and thus lowers the barrier to entry for users new to R or machine learing.
|
2023-06-16 |
r-tinytest
|
public |
Provides a lightweight (zero-dependency) and easy to use unit testing framework. Main features: install tests with the package. Test results are treated as data that can be stored and manipulated. Test files are R scripts interspersed with test commands, that can be programmed over. Fully automated build-install-test sequence for packages. Skip tests when not run locally (e.g. on CRAN). Flexible and configurable output printing. Compare computed output with output stored with the package. Run tests in parallel. Extensible by other packages. Report side effects.
|
2023-06-16 |
r-thregi
|
public |
Fit a threshold regression model for Interval Censored Data based on the first-hitting-time of a boundary by the sample path of a Wiener diffusion process. The threshold regression methodology is well suited to applications involving survival and time-to-event data.
|
2023-06-16 |
r-threegroups
|
public |
Implements the Maximum Likelihood estimator for baseline, placebo, and treatment groups (three-group) experiments with non-compliance proposed by Gerber, Green, Kaplan, and Kern (2010).
|
2023-06-16 |
r-texreg
|
public |
Converts coefficients, standard errors, significance stars, and goodness-of-fit statistics of statistical models into LaTeX tables or HTML tables/MS Word documents or to nicely formatted screen output for the R console for easy model comparison. A list of several models can be combined in a single table. The output is highly customizable. New model types can be easily implemented.
|
2023-06-16 |
r-tcgaretriever
|
public |
The Cancer Genome Atlas (TCGA) is a program aimed at improving our understanding of Cancer Biology. Several TCGA Datasets are available online. 'TCGAretriever' helps accessing and downloading TCGA data hosted on 'cBioPortal' via its Web Interface (see <http://www.cbioportal.org/web_api.jsp> for more information). Features of 'TCGAretriever' include: 1) it is very simple to use (get all the TCGA data you need with a few lines of code); 2) performance (smooth and reliable data download via 'httr'); 3) it is tailored for downloading large volumes of data.
|
2023-06-16 |
r-suntersampling
|
public |
Functions for drawing samples according to Sunter's sampling design, and for computing first and second order inclusion probabilities
|
2023-06-16 |
r-subsamp
|
public |
This subsample winner algorithm (SWA) for regression with a large-p data (X, Y) selects the important variables (or features) among the p features X in explaining the response Y. The SWA first uses a base procedure, here a linear regression, on each of subsamples randomly drawn from the p variables, and then computes the scores of all features, i.e., the p variables, according to the performance of these features collected in each of the subsample analyses. It then obtains the 'semifinalist' of the features based on the resulting scores and determines the 'finalists', i.e., the important features, from the 'semifinalist'. Fan, Sun and Qiao (2017) <http://sr2c.case.edu/swa-reg/>.
|
2023-06-16 |
r-subgroup.discovery
|
public |
Developed to assist in discovering interesting subgroups in high-dimensional data. The PRIM implementation is based on the 1998 paper "Bump hunting in high-dimensional data" by Jerome H. Friedman and Nicholas I. Fisher. <doi:10.1023/A:1008894516817> PRIM involves finding a set of "rules" which combined imply unusually large (or small) values of some other target variable. Specifically one tries to find a set of sub regions in which the target variable is substantially larger than overall mean. The objective of bump hunting in general is to find regions in the input (attribute/feature) space with relatively high (low) values for the target variable. The regions are described by simple rules of the type if: condition-1 and ... and condition-n then: estimated target value. Given the data (or a subset of the data), the goal is to produce a box B within which the target mean is as large as possible. There are many problems where finding such regions is of considerable practical interest. Often these are problems where a decision maker can in a sense choose or select the values of the input variables so as to optimize the value of the target variable. In bump hunting it is customary to follow a so-called covering strategy. This means that the same box construction (rule induction) algorithm is applied sequentially to subsets of the data.
|
2023-06-16 |
r-stringb
|
public |
Base R already ships with string handling capabilities 'out- of-the-box' but lacks streamlined function names and workflow. The 'stringi' ('stringr') package on the other hand has well named functions, extensive Unicode support and allows for a streamlined workflow. On the other hand it adds dependencies and regular expression interpretation between base R functions and 'stringi' functions might differ. This packages aims at providing a solution to the use case of unwanted dependencies on the one hand but the need for streamlined text processing on the other. The packages' functions are solely based on wrapping base R functions into 'stringr'/'stringi' like function names. Along the way it adds one or two extra functions and last but not least provides all functions as generics, therefore allowing for adding methods for other text structures besides plain character vectors.
|
2023-06-16 |
r-stem
|
public |
Estimation of the parameters of a spatio-temporal model using the EM algorithm, estimation of the parameter standard errors using a spatio-temporal parametric bootstrap, spatial mapping.
|
2023-06-16 |
r-stackoverflow
|
public |
Helper functions collected from StackOverflow.com, a question and answer site for professional and enthusiast programmers.
|
2023-06-16 |
r-stabs
|
public |
Resampling procedures to assess the stability of selected variables with additional finite sample error control for high-dimensional variable selection procedures such as Lasso or boosting. Both, standard stability selection (Meinshausen & Buhlmann, 2010, <doi:10.1111/j.1467-9868.2010.00740.x>) and complementary pairs stability selection with improved error bounds (Shah & Samworth, 2013, <doi:10.1111/j.1467-9868.2011.01034.x>) are implemented. The package can be combined with arbitrary user specified variable selection approaches.
|
2023-06-16 |
r-stabm
|
public |
An implementation of many measures for the assessment of the stability of feature selection. Both simple measures and measures which take into account the similarities between features are available, see Bommert et al. (2017) <doi:10.1155/2017/7907163>.
|
2023-06-16 |
r-sssimple
|
public |
Simulate, solve state space models
|
2023-06-16 |
r-unitcircle
|
public |
The uc.check() function checks whether the roots of a given polynomial lie outside the Unit circle. You can also easily draw an unit circle.
|
2023-06-16 |
r-uiucthemes
|
public |
A set of custom 'R' 'Markdown' templates for documents and presentations with the University of Illinois at Urbana-Champaign (UIUC) color scheme and identity standards.
|
2023-06-16 |
r-udapi
|
public |
A client for the Urban Dictionary <http://www.urbandictionary.com/> API.
|
2023-06-16 |
r-twoway
|
public |
Carries out analyses of two-way tables with one observation per cell, together with graphical displays for an additive fit and a diagnostic plot for removable 'non-additivity' via a power transformation of the response. It implements Tukey's Exploratory Data Analysis methods, including a 1-degree-of-freedom test for row*column 'non-additivity', linear in the row and column effects.
|
2023-06-16 |
r-ttmoment
|
public |
Computing the first two moments of the truncated multivariate t (TMVT) distribution under the double truncation. Appling the slice sampling algorithm to generate random variates from the TMVT distribution.
|
2023-06-16 |
r-tsbox
|
public |
Time series toolkit with identical behavior for all time series classes: 'ts','xts', 'data.frame', 'data.table', 'tibble', 'zoo', 'timeSeries', 'tsibble', 'tis' or 'irts'. Also converts reliably between these classes.
|
2023-06-16 |
r-tosls
|
public |
Fit an Instrumental Variables Two Stage Least Squares model
|
2023-06-16 |