README.md

# AnalysisPrep

Support data extraction for analysis and machine learning.

In order to extract data from several systems, I've combined some of the
tools that make this easier.

* Normalize and scale a data series
* One hot or label encode a data series
* Calculate frequencies and frequency percentages
* Summarize variables with basic statistics (mean, sum, min, max, median, standard deviation, etc.)
* Estimate the correlation between variables.
* Extract probabilistic values for Markov models and other probabilistic models.
* Save data into CSV files

For now, the Doctests are the best documentation for using these features.  I'll write up some better
demonstrations soon.

## TODO:

This is a work in progress. There are several things I'd still like to add.

* Demonstrate the core features
* Extraction via Ecto
* Binning functions
* Test and expand to support GenStage and Flow
* Time series analysis

## Installation

If [available in Hex](https://hex.pm/docs/publish), the package can be installed as:

  1. Add `analysis_prep` to your list of dependencies in `mix.exs`:

    ```elixir
    def deps do
      [{:analysis_prep, "~> 0.1.0"}]
    end
    ```

  2. Ensure `analysis_prep` is started before your application:

    ```elixir
    def application do
      [applications: [:analysis_prep]]
    end
    ```