mindsdb
diff --git a/‎docssrc/source/conf.py
+1-1 b/‎docssrc/source/conf.py
+1-1
diff --git a/‎docssrc/source/index.rst
+8-4 b/‎docssrc/source/index.rst
+8-4
diff --git a/‎docssrc/source/lightwood_philosophy.rst
+44-5 b/‎docssrc/source/lightwood_philosophy.rst
+44-5
diff --git a/‎docssrc/source/mixer.rst
+7 b/‎docssrc/source/mixer.rst
+7
diff --git a/‎docssrc/source/mixer/helpers/helpers.rst
-4 b/‎docssrc/source/mixer/helpers/helpers.rst
-4
diff --git a/‎docssrc/source/mixer/mixer.rst
-9 b/‎docssrc/source/mixer/mixer.rst
-9
diff --git a/‎docssrc/source/tutorials.rst
+5-25 b/‎docssrc/source/tutorials.rst
+5-25
@@ -11,7 +11,7 @@
 import sys
 import os
 import datetime
-import sphinx_rtd_theme
+
 
 # ----------------- #
 # Project information
 
@@ -5,7 +5,7 @@
    contain the root ``toctree`` directive.
 
 ****************************************
-Welcome to Lightwood's Documentation!
+Lightwood
 ****************************************
 
 :Release: |release|
@@ -26,7 +26,6 @@ Quick Guide
 - :ref:`Installation <Installation>`
 - :ref:`Example Use Cases <Example Use Cases>`
 - :ref:`Contribute to Lightwood <Contribute to Lightwood>`
-- :ref:`Hacktoberfest 2021 <Hacktoberfest 2021>`
 
 Installation
 ============
@@ -225,7 +224,12 @@ Other Links
 .. toctree::
    :maxdepth: 8
 
-   lightwood_philosophy
    tutorials
    api
-   data
+   data
+   encoder
+   mixer
+   ensemble
+   analysis
+   helpers
+   lightwood_philosophy
@@ -1,6 +1,21 @@
 :mod:`Lightwood Philosophy`
 ================================
 
+
+Introduction
+------------
+
+Lightwood works by generating code for `Predictor` objects out of structured data (e.g. a data frame) and a problem definition. The simplest possible definition being the column to predict.
+
+The data can be anything. It can contain numbers, dates, categories, text (in any language, but English is currently the primary focus), quantities, arrays, matrices, images, audio, or video. The last three as paths to the file system or URLs, since storing them as binary data can be cumbersome.
+
+The generated `Predictor` object can be fitted by calling a learn method, or through a lower level step-by-step API. It can then make predictions on similar data (same columns except for the target) by calling a predict method. That's the gist of it.
+
+There's an intermediate representation that gets turned into the final `Python` code, called `JsonAI`. This provides an easy way to edit the `Predictor` being generated from the original data and problem specifications. It also enables prototyping custom code without modifying the library itself, or even having a "development" version of the library installed.
+
+Pipeline
+------------
+
 Lightwood abstracts the ML pipeline into 3 core steps:
 
 1. Pre-processing and data cleaning
@@ -11,24 +26,48 @@ Lightwood abstracts the ML pipeline into 3 core steps:
     :align: center
     :alt: Lightwood "under-the-hood"
 
+By default, each of them entails:
+
 i) Pre-processing and cleaning
 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
-For each column in your dataset, Lightwood will identify the suspected data type (numeric, categorical, etc.) via a brief statistical analysis. From this, it will generate a JSON-AI syntax. 
+For each column in your dataset, Lightwood will infer the suspected data type (numeric, categorical, etc.) via a brief statistical analysis. From this, it will generate a JsonAI object. 
 
-If the user keeps default behavior, Lightwood will perform a brief pre-processing approach to clean each column according to its identified data type. From there, it will split the data into train/dev/test splits.
+Lightwood will perform a brief pre-processing approach to clean each column according to its identified data type (e.g. dates represented as a mix of string formats and timestamp floats are converted to datetime objects). From there, it will split the data into train/dev/test splits.
 
 The `cleaner` and `splitter` objects respectively refer to the pre-processing and the data splitting functions.
 
 ii) Feature Engineering
 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
-Data can be converted into features via "encoders". Encoders represent the rules for transforming pre-processed data into a numerical representations that a model can be used. 
+Data can be converted into features via "encoders". Encoders represent the rules for transforming pre-processed data into a numerical representation that a model can use. 
 
 Encoders can be **rule-based** or **learned**. A rule-based encoder transforms data per a specific set of instructions (ex: normalized numerical data) whereas a learned encoder produces a representation of the data after training (ex: a "\[CLS\]" token in a language model).
 
-Encoders are assigned to each column of data based on the data type; users can override this assignment either at the column-based level or at the data-type based level. Encoders inherit from the `BaseEncoder` class. 
+Encoders are assigned to each column of data based on the data type, and depending on the type there can be inter-column dependencies (e.g. time series). Users can override this assignment either at the column-based level or at the datatype-based level. Encoders inherit from the `BaseEncoder` class. 
 
 iii) Model Building and Training
 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
 We call a predictive model that intakes *encoded* feature data and outputs a prediction for the target of interest a `mixer` model. Users can either use Lightwood's default mixers or create their own approaches inherited from the `BaseMixer` class.
 
-We predominantly use PyTorch based approaches, but can support other models.
+We predominantly use PyTorch based approaches, but can support other models.
+
+Multiple mixers can be trained for any given `Predictor`. After mixer training, an ensemble is created (and potentially trained) to decide which mixers to use and how to combine their predictions.
+
+Finally, a "model analysis" step looks at the whole ensemble and extracts some stats about it, in addition to building confidence models that allow us to output a confidence and prediction intervals for each prediction. We also use this step to generate some explanations about model behavior.
+
+Predicting is very similar: data is cleaned and encoded, then mixers make their predictions and they get ensembled. Finally, explainer modules determine things like confidence, prediction bounds, and column importances.
+
+
+Strengths and drawbacks
+------------------------
+
+The main benefit of lightwood's architecture is that it is very easy to extend. Full understanding (or even any understanding) of the pipeline is not required to improve a specific component. Users can still easily integrate their custom code with minimal hassle, even if PRs are not accepted, while still pulling everything else from upstream. This works well with the open-source nature of the project.
+
+The second advantage this provides is that it is relatively trivial to parallelize since most tasks are done per-feature. The bits which are done on all the data (mixer training and model analysis) are made up of multiple blocks with similar APIs which can themselves be run in parallel.
+
+Finally, most of lightwood is built on PyTorch, and PyTorch mixers and encoders are first-class citizens in so far as the data format makes it easiest to work with them. In that sense performance on specialized hardware and continued compatibility is taken care of for us, which frees up time to work on other things.
+
+The main drawback, however, is that the pipeline separation doesn't allow for phases to wield great influence on each other or run in a joint fashion. This both means you can't easily have stuff like mixer gradients propagating through and training encoders, nor analysis blocks looking at the model and deciding the data cleaning procedure should change. Granted, there's no hard limit on this, but any such implementation would be rather unwieldy in terms of code complexity.
+
+
+
+
@@ -0,0 +1,7 @@
+:mod:`Mixers`
+==========================
+
+Mixers learn to map encoded representation, they are the core of lightwood's automl.
+
+.. automodule:: mixer
+   :members:
@@ -4,31 +4,11 @@
    :maxdepth: 1
    :caption: Table of Contents:
 
-
-Getting started with Lightwood and JSON-AI
-----------------------------------------------
-The following tutorial will walk you through a simple tabular dataset with JSON-AI.
-
-| How to use Lightwood for your data (Coming Soon!)
-| `Lightwood for a quick data analysis <tutorials/tutorial_data_analysis/tutorial_data_analysis.ipynb>`_
-
-
-Run models with more complex data types
-------------------------------------------------
-
-Below, you can see how Lightwood handles language and time-series data.
-
-| Using Language Models (Coming Soon!)
-| Make your own timeseries predictor (Coming Soon!)
-
-
-Bring your own custom methods
-------------------------------------------------
-We support users bringing their custom methods. To learn how to build your own pipelines, check out the following notebooks:
-
 | `Construct a custom preprocessor to clean your data <tutorials/custom_cleaner/custom_cleaner.ipynb>`_
 | `Make your own train and test split <tutorials/custom_splitter/custom_splitter.ipynb>`_
-| `Create your own encoder to featurize your data <tutorials/custom_encoder_rulebased/custom_encoder_rulebased.ipynb>`_ (Rule-based)
-| Create your own encoder to featurize your data using a learned representation (Coming Soon!)
+| `Create your own encoder <tutorials/custom_encoder_rulebased/custom_encoder_rulebased.ipynb>`_ 
 | `Design a custom mixer model <tutorials/custom_mixer/custom_mixer.ipynb>`_
-| `Use your own model explainer <tutorials/custom_explainer/custom_explainer.ipynb>`_
+| `Use your own model explainer <tutorials/custom_explainer/custom_explainer.ipynb>`_
+| `Solve a timeseries problem <tutorials/tutorial_time_series/tutorial_time_series.ipynb>`_
+| `Using lightwood for data analysis <tutorials/tutorial_data_analysis/tutorial_data_analysis.ipynb>`_
+| `Update existing mixers with new data <tutorials/tutorial_update_models/tutorial_update_models.ipynb>`_