HNet documentation!

|Python |Python Version Sphinx documentation Stars lines of code Downloads per month Downloads in total License Github Forks Open Issues Project Status Substack Blog Colab example Cite repo-size donate

Real-world data sets often contain measurements with both continues and categorical values for the same sample. Despite the availability of many libraries, data sets with mixed data types require intensive pre-processing steps, and it remains a challenge to describe the relationships of one variable on another. The data understanding part is crucial but without making any assumptions on the model form, the search space is super-exponential in the number of variables and therefore not a common practice. We propose graphical hypergeometric networks (HNet), a method where associations across variables are tested for significance by statistical inference. HNet learns the Association from datasets with mixed datatypes and with unknown function. Input datasets can range from generic dataframes to nested data structures with lists, missing values and enumerations. The aim is to determine a network with significant associations that can shed light on the complex relationships across variables. HNet can be used for all kind of datasets that contain features such as categorical, boolean, and/or continuous values.

Use HNET if your goal is:

    1. Explore the complex associations between your variables.

    1. Explain your clusters by enrichment of the meta-data.

    1. Transform your feature space into network graph and/or dissimilarity matrix that can be used for further analysis.


Note

Your ❤️ is important to keep maintaining this package. You can support in various ways, have a look at the sponser page. Report bugs, issues and feature extensions at github page.

pip install hnet

Content

Installation

Indices and tables