×

A set of scikit-learn-style transformers for encoding categorical variables into numeric with different techniques. While ordinal, one-hot, and hashing encoders have similar equivalents in the existing scikit-learn version, the transformers in this library all share a few useful properties: - First-class support for pandas dataframes as an input (and optionally as output) - Can explicitly configure which columns in the data are encoded by name or index, or infer non-numeric columns regardless of input type - Can drop any columns with very low variance based on training set optionally - Portability: train a transformer on data, pickle it, reuse it later and get the same thing out. - Full compatibility with sklearn pipelines, input an array-like dataset like any other transformer

Uploaded Mon Mar 31 20:59:03 2025
md5 checksum f18f294c0588c9ffaf44c2da91841080
arch x86_64
build py311h06a4308_0
depends numpy >=1.11.1,<2.0a0, pandas >=0.20.1, patsy >=0.4.1, python >=3.11,<3.12.0a0, scikit-learn >=0.17.1, scipy >=0.17.0, statsmodels >=0.6.1
license BSD-3-Clause
license_family BSD
md5 f18f294c0588c9ffaf44c2da91841080
name category_encoders
platform linux
sha1 c73205a2701ad2fc57dedf1aa2c8a892c2dda1fb
sha256 c84c7e2f8169c504b2b5abbc542d30291c4352fbfe0e400b71d7f6295aba1163
size 81615
subdir linux-64
timestamp 1690384595380
version 1.3.0