.. _model-docs-page:

Model
=====

This document describes the classes in ``mtenn.model`` and their substituent parts, and the format of their inputs and outputs.

Model Blocks
------------

Each class in ``mtenn.model`` is comprised of at least one of the following blocks:

* :ref:`representation-block`
* :ref:`strategy-block`
* :ref:`readout-block`
* :ref:`combination-block`

.. _representation-block:

Representation
^^^^^^^^^^^^^^

The ``Representation`` block (``mtenn.representation``) is responsible for taking an input structure and learning an n-dimensional vector embedding.
In practice, this block is a thin wrapper around an existing model architecture, potentially with some ad hoc manipulation done to ensure the output is a vector rather than a single scalar value.

For more information on the models that are currently implemented in ``mtenn``, see the :ref:`current-models` section.

For information on adding new models into the ``mtenn`` framework, see the guide on :ref:`adding a new installable model <new-installable-model-guide>`.


.. _strategy-block:

Strategy
^^^^^^^^

The ``Strategy`` block (``mtenn.strategy``) is responsible for taking any number of vectors, each output from a ``Representation`` block, and combining them into a :math:`\Delta g_{\mathrm{bind}}` prediction in kT units.

Currently, the following ``Strategy`` blocks are implemented in ``mtenn``:

* :py:class:`mtenn.strategy.DeltaStrategy`
* :py:class:`mtenn.strategy.ConcatStrategy`
* :py:class:`mtenn.strategy.ComplexOnlyStrategy`

.. _readout-block:

Readout
^^^^^^^

The ``Readout`` block (``mtenn.readout``) is responsible for converting the :math:`\Delta g_{\mathrm{bind}}` prediction output from a ``Strategy`` block from kT units into any arbitrary other unit.
Importantly, in contrast to the two previous blocks, this block doesn't have any learned parameters.
This increases model portability, as it allows the same model to be trained on multiple different data types by only swapping out this last layer.

We currently have implementations for :py:class:`pIC50 <mtenn.readout.PIC50Readout>` and :py:class:`pKi <mtenn.readout.PKiReadout>`.

.. _combination-block:

Combination
^^^^^^^^^^^

The ``Combination`` block (``mtenn.combination``) is responsible for combining multiple model predictions for the same compound into a single prediction.
The internal workings of these blocks are a bit complex, and a more in-depth explanation is given in the :doc:`combination` page.

Currently, the following ``Combination`` blocks are implemented in ``mtenn``:

* :py:class:`mtenn.combination.MeanCombination`
* :py:class:`mtenn.combination.MaxCombination`

.. _single-pose-model-docs:

Single-Pose Models
------------------

This section is a description of the ``mtenn.model.Model`` class (referred to as ``Model`` from here), which makes a prediction on a single input conformation.
The general data flow through a ``Model`` object is as depicted in the below diagram:

.. image:: /static/mtenn_model_diagram.png

In text form this is:

#. The protein-ligand complex structure is passed to the ``Model``
#. Internally, the ``Model`` breaks the structure into 3 sub-structures: the full complex, just the protein, and just the ligand
#. Each of these sub-structures is individually passed to the ``Representation`` block to generate a total of 3 vector representations
#. All 3 representations are passed to the ``Strategy`` block, where they are combined into a :math:`\Delta g_{\mathrm{bind}}` prediction in implicit kT units
#. (optional) The :math:`\Delta g_{\mathrm{bind}}` prediction is passed to the ``Readout`` block, where it is converted into whatever the final units are

.. _multi-pose-model-docs:

Multi-Pose Models
-----------------

This section is a description of the ``mtenn.model.GroupedModel`` class (``GroupedModel`` from here), which makes a prediction on multiple input conformations.
The general data flow through a ``GroupedModel`` object is as depicted in the below diagram:

.. image:: /static/mtenn_grouped_model_diagram.png

In text form this is:

#. Each input conformation is passed through the same ``Model`` object to get a prediction for each individual conformation
#. All predictions are passed through a ``Combination`` block to get an overall :math:`\Delta g_{\mathrm{bind}}` prediction for the group of input poses
#. (optional) The overall :math:`\Delta g_{\mathrm{bind}}` prediction is passed to the ``Readout`` block, where it is converted into whatever the final units are

.. _ligand-only-model-docs:

Ligand-Only Models
------------------

This section is a description of the ``mtenn.model.LigandOnlyModel`` class (``LigandOnlyModel`` from here), which makes a prediction based only on a ligand representation.
This class is mainly useful for 2D baseline models to compare the structure-based models against (eg ligand-only GNNs, fingerprint-based models, etc).
The general data flow through a ``LigandOnlyModel`` object is the same as for a ``Model``, but the ``Representation`` block is responsible for generating the energy prediction from the input, and the ``Strategy`` block is simply the identity function.

.. _current-models:

Currently Implemented Models
----------------------------

* :py:mod:`Graph Attention <mtenn.conversion_utils.gat>`
* :py:mod:`SchNet <mtenn.conversion_utils.schnet>`
* :py:mod:`e3nn <mtenn.conversion_utils.e3nn>`
* :py:mod:`VisNet <mtenn.conversion_utils.visnet>`

Data Model
----------

Input Data
^^^^^^^^^^

Currently, all of the single-pose models in ``mtenn`` (``Model`` and ``LigandOnlyModel``) expect a ``dict`` object to be passed as their input.
The ``GroupedModel`` expects a list of these ``dicts``, each one corresponding to a different input pose.
What keys each model expects in the ``dict`` is left to the implementation of that model in the ``conversion_utils`` module.
For more details on the data expected by each model, check that model's docs page.

Output Data
^^^^^^^^^^^

To unify the outputs of all model types, all 3 models (``Model``, ``GroupedModel``, and ``LigandOnlyModel``) return two values: a scalar value that represents the model's final prediction, and a list of values that represent the pre-``Readout`` predictions of each input pose.
In the case of the single-pose models, this list will have exactly one element.
In the case of a multi-pose model, this list will have one element for each element in the list of input poses.