GitHub - Augus1999/ChemBFN-WebUI: A web-based UI to host ChemBFN generative models and visualise the generated molecules. 🛠️pip install chembfn-webui

Web-based UI visualisation tool for ChemBFN method

Important

For the security concerning, it is not recommended to use this application as a public service. When deploying on a local host as a shared application, it is better to install this application in a container or VM, to prevent this application from accessing the Internet, and to limit the premissions of read, create, and delete loacal files and directories.

1. Install

$ pip install -U chembfn_webui

2. Place model files

Place customised vocabulary files (plain text .txt format) under chembfn_webui/model/vocab.
Place base model files (pretrained model in .pt format) under chembfn_webui/model/base_model.
Group standalone model files (model.pt, optional mlp.pt) and configuration file (config.json) in one folder for each model and place these folders under chembfn_webui/model/standalone_model.
Group LoRA model files (lora.pt, optional mlp.pt) and configuration file (config.json) in one folder for each model and place these folders under chembfn_webui/model/lora.

For example,

├───chembfn_webui
    ├───bin
    ├───cache
    ├───lib
    └───model
        ├───base_model
        |   └───zinc15_190m.pt
        ├───lora
        │   └───csd_ees
        │       ├───lora.pt
        |       ├───mlp.pt
        |       └───config.json
        ├───standalone_model
        │   ├───guacamol
        │   |   ├───model.pt
        │   |   └───config.json
        │   └───qm9
        |        ├───model.pt
        |        ├───mlp.pt
        |        └───config.json
        └───vocab
            └───moses_selfies_vocab.txt

Note

The file config.json is automatically saved by CLI tool Madmol provided in bayesianflow-for-chem package. If you train models via Python API, you need to manually create that file for your models by filling in the tempate:
{
   "padding_index": 0,
   "start_index": 1,
   "end_index": 2,
   "padding_strategy": "static",
   "padding_length": PADDING_LENGTH,
   "label": [LABEL_NAME_I, LABEL_NAME_II, ...],
   "name": JOB_NAME
}
The configureation file for base models can be downloaded here.

If placed correctly, all these files can be seen in the "model explorer" tab.

You can use an external folder to host the models if it follows the same structure as chembfn_webui/model. See the next section for the method.

3. Launch the program

I. launch the web-UI

$ chembfn

II. launch the web in a public link

$ chembfn --public

III. use an external directory to hold the model files (Linux and MacOS)

$ CHEMBFN_WEBUI_MODEL_DIR={YOUR/MODEL/DIR} chembfn

IV. use an external directory to hold the model files (Windows)

# CMD ↓
> set CHEMBFN_WEBUI_MODEL_DIR={YOUR/MODEL/DIR} & chembfn

# PowerShell ↓
> $env:CHEMBFN_WEBUI_MODEL_DIR="{YOUR/MODEL/DIR}" & chembfn

V. use an external directory to hold the model files (Notebook, Google Colab)

import os
os.environ["CHEMBFN_WEBUI_MODEL_DIR"] = "{YOUR/MODEL/DIR}"
!chembfn --public

4. Write the prompt

Leave prompt blank for unconditional generation.
For standalone models, key in objective values in the format of [a,b,c,...] to pass the values to the model.
Key in <name:A> or <name:A>:[a,b,c,...] to select LoRA parameter and pass the objective values if necessary, where name is the LoRA model name and A is the LoRA scaling. You can easily select a LoRA model by clicking the model name in "LoRA models" tab as well.
You can stack several LoRA models together to form an ensemble model by prompt like <name1:A1>:[a1,b1,c1,...];<name2:A2>:[a2,b2,...];.... Note that here A1, A2, etc are contributions of each model to the ensemble.

5. Advanced control

Under "advanced control" tab

You can control semi-autoregressive behaviours by key in F for switching off SAR, T for switching on SAR, and prompt like F,F,T,... to individually control the SAR in an ensemble model.
You can add unwanted tokens, e.g., [Cu],p,[Si].
You can customise the result preprocessing function, e.g., the model output a reaction SMILES "CCI.C[O-]>>COCC" which couldn't be recognised by RDKit; you can pass lambda x: x.split(">>")[-1] to force the program only looking at the products.

6. Generate molecules

Click "RUN" then here you go! If error occured, please check your prompts and settings.

Where to obtain the models?

Pretrained models: https://huggingface.co/suenoomozawa/ChemBFN
ChemBFN source code: https://github.com/Augus1999/bayesian-flow-network-for-chemistry
ChemBFN document: https://augus1999.github.io/bayesian-flow-network-for-chemistry/
ChemBFN package: https://pypi.org/project/bayesianflow-for-chem/

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
.github		.github
chembfn_webui		chembfn_webui
example		example
image		image
test		test
LICENSE		LICENSE
README.md		README.md
other-requirements.txt		other-requirements.txt
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web-based UI visualisation tool for ChemBFN method

1. Install

2. Place model files

3. Launch the program

4. Write the prompt

5. Advanced control

6. Generate molecules

Where to obtain the models?

About

Uh oh!

Uh oh!

Languages

License

Augus1999/ChemBFN-WebUI

Folders and files

Latest commit

History

Repository files navigation

Web-based UI visualisation tool for ChemBFN method

1. Install

2. Place model files

3. Launch the program

4. Write the prompt

5. Advanced control

6. Generate molecules

Where to obtain the models?

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages