|
# Model documentation & parameters |
|
|
|
**Algorithm Version**: Which model version to use. |
|
|
|
**Primer SMILES**: The SMILES string for priming the generation (**has** to be provided). |
|
|
|
**Maximal sequence length**: The maximal number of SMILES tokens in the generated molecule. |
|
|
|
**Sampling uniquely**: Generate unique sample sequences if set to true. |
|
|
|
**Number of samples**: How many samples should be generated (between 1 and 50). |
|
|
|
|
|
# Model card -- REINVENT |
|
|
|
**Model Details**: *REINVENT* is a collection of tools for *de novo* drug design. Here, we showcase the SMILES-based generative model. For details, see [Blaschke et al. (2020); *J. Chem. Inf. Model.*](https://pubs.acs.org/doi/10.1021/acs.jcim.0c00915). |
|
|
|
**Developers**: Thomas Blaschke and colleagues from AstraZeneca. |
|
|
|
**Distributors**: Original authors' code integrated into GT4SD. |
|
|
|
**Model date**: 2020, see the [REINVENT 2.0 paper](https://pubs.acs.org/doi/pdf/10.1021/acs.jcim.0c00915) |
|
|
|
**Model version**: N.A. |
|
|
|
**Model type**: A sequence-based molecular generator from the REINVENT toolbox. |
|
|
|
**Information about training algorithms, parameters, fairness constraints or other applied approaches, and features**: |
|
The code for getting unique sample sequences, randomizing scaffolds, and generation of the dataset as well as the dataloader was taken from the original implementation of [Molecular Reinvent](https://github.com/MolecularAI/Reinvent). Our implementation does not include [BaseAction](https://github.com/MolecularAI/Reinvent/blob/982b26dd6cfeb8aa84b6d7e4a8c2a7edde2bad36/running_modes/lib_invent/rl_actions/sample_model.py#:~:text=class%20BaseAction(abc.ABC)%3A) as a parent class for the [ReinventBase](/gt4sd/algorithms/conditional_generation/reinvent/reinvent_core/core.py) where we have added all the functions of [Molecular Reinvent](https://github.com/MolecularAI/Reinvent). |
|
|
|
|
|
**Paper or other resource for more information**: |
|
[REINVENT 2.0 -- Blaschke et al. (2020); *J. Chem. Inf. Model.*](https://pubs.acs.org/doi/10.1021/acs.jcim.0c00915). |
|
|
|
**License**: MIT |
|
|
|
**Where to send questions or comments about the model**: Open an issue on [GT4SD-REINVENT-repository](https://github.com/GT4SD/reinvent-models). |
|
|
|
**Intended Use. Use cases that were envisioned during development**: Chemical research, in particular drug discovery. |
|
|
|
**Primary intended uses/users**: Researchers and computational chemists using the model for model comparison or research exploration purposes. |
|
|
|
**Out-of-scope use cases**: Production-level inference, producing molecules with harmful properties. |
|
|
|
**Metrics**: N.A. |
|
|
|
**Datasets**: N.A. |
|
|
|
**Ethical Considerations**: Unclear, please consult with original authors in case of questions. |
|
|
|
**Caveats and Recommendations**: Unclear, please consult with original authors in case of questions. |
|
|
|
Model card prototype inspired by [Mitchell et al. (2019)](https://dl.acm.org/doi/abs/10.1145/3287560.3287596?casa_token=XD4eHiE2cRUAAAAA:NL11gMa1hGPOUKTAbtXnbVQBDBbjxwcjGECF_i-WC_3g1aBgU1Hbz_f2b4kI_m1in-w__1ztGeHnwHs) |
|
|
|
## Citation |
|
```bib |
|
@article{blaschke2020reinvent, |
|
title={REINVENT 2.0: an AI tool for de novo drug design}, |
|
author={Blaschke, Thomas and Ar{\'u}s-Pous, Josep and Chen, Hongming and Margreitter, Christian and Tyrchan, Christian and Engkvist, Ola and Papadopoulos, Kostas and Patronov, Atanas}, |
|
journal={Journal of chemical information and modeling}, |
|
volume={60}, |
|
number={12}, |
|
pages={5918--5922}, |
|
year={2020}, |
|
publisher={ACS Publications} |
|
} |
|
``` |