|
--- |
|
language: |
|
- en |
|
tags: |
|
- embeddings |
|
- multimodal |
|
- olfaction-vision-language |
|
- olfaction |
|
- olfactory |
|
- diffusion |
|
- scentience |
|
- neural-network |
|
- graph-neural-network |
|
- gnn |
|
- molecule |
|
- vision-language |
|
- vision |
|
- language |
|
- robotics |
|
- multimodal |
|
- smell |
|
license: mit |
|
datasets: |
|
- kordelfrance/olfaction-vision-language-dataset |
|
- detection-datasets/coco |
|
base_model: DEGNN-Unconstrained |
|
--- |
|
|
|
Diffusion Graph Neural Networks for Robust Olfactory Navigation in Robotics |
|
---- |
|
|
|
<div align="center"> |
|
|
|
**Olfaction β’ Vision β’ Language** |
|
|
|
|
|
[](#license) |
|
[](https://colab.research.google.com/drive/1z-ITTEfVtMMbfbN50u2AfQhzvuYkrRn7?usp=sharing) |
|
[](https://arxiv.org/abs/2506.00455) |
|
[](https://huggingface.co/spaces) |
|
|
|
</div> |
|
|
|
|
|
An open-sourced diffusion-based equivariant graph neural network (DEGNN) for olfaction-vision-language tasks. |
|
|
|
--- |
|
|
|
## Model Description |
|
|
|
Navigation by scent is a capability in robotic systems that is rising in demand. |
|
However, current methods often suffer from ambiguities, particularly when robots misattribute odours to incorrect objects due to limitations in olfactory datasets and sensor resolutions. |
|
To address challenges in olfactory navigation, we introduce a novel machine learning method using diffusion-based molecular gen- |
|
eration that can be used by itself or with automated olfactory |
|
dataset construction pipelines. |
|
Our models, diffusion-based equivariant graph neural networks (`DEGNN` for short), leverage the state of the art in molecular generation and aroma mapping. |
|
This generative process of our diffusion model expands the chemical space beyond the limitations |
|
of both current olfactory datasets and training methods, enabling |
|
the identification of potential odourant molecules not previously |
|
documented. The generated molecules can then be more accurately validated using advanced olfactory sensors, enabling |
|
them to detect more compounds and inform better hardware |
|
design. By integrating visual analysis, language processing, and |
|
molecular generation, our framework enhances the ability of |
|
olfaction-vision models on robots to accurately associate odours |
|
with their correct sources, thereby improving navigation and |
|
decision-making through better sensor selection for a target |
|
compound in critical applications such as explosives detection, |
|
narcotics screening, and search and rescue. Our methodology |
|
represents a foundational advancement in the field of artificial |
|
olfaction, offering a scalable solution to challenges posed by |
|
limited olfactory data and sensor ambiguities. |
|
|
|
We offer two models with this repository: |
|
- (1) `DEGNN-constrained`: A diffusion model with its associated olfactory conditioner that is constrained to only generate molecules based on the atoms `C`, `N`, `O`, `F`, `P`, `S`, and `Cl`. |
|
- (2) `DEGNN-unconstrained`: A diffusion model with its associated olfactory conditioner that is unconstrained and may generate molecules from any atom. |
|
|
|
--- |
|
|
|
## Model Details |
|
- **Model Name:** `DEGNN Constrained` |
|
- **Developed by:** Kordel K. France |
|
- **Date:** September 2025 |
|
- **Architecture:** |
|
- **Olfaction conditioner:** Feedforward Neural Network |
|
- **Diffusion model:** Equivariant Graph Neural Network conditioned on atoms C, N, O, F, P, S, Cl |
|
- **License:** MIT |
|
- **Contact:** kordel@scentience.ai, kordel.france@utdallas.edu |
|
|
|
--- |
|
|
|
|
|
- **Model Name:** `DEGNN Unonstrained` |
|
- **Developed by:** Kordel K. France |
|
- **Date:** September 2025 |
|
- **Architecture:** |
|
- **Olfaction conditioner:** Feedforward Neural Network |
|
- **Diffusion model:** Equivariant Graph Neural Network conditioned on all available atoms in training data |
|
- **License:** MIT |
|
- **Contact:** kordel@scentience.ai, kordel.france@utdallas.edu |
|
|
|
--- |
|
|
|
## Intended Use |
|
- **Primary purpose:** Research in multimodal machine learning involving olfaction, vision, and language. |
|
- **Example applications:** |
|
- Robotics and UAV navigation guided by chemical cues |
|
- Chemical dataset exploration and visualization |
|
- **Intended users:** Researchers, developers, and educators working in ML, robotics, chemistry, and HCI. |
|
- **Out of scope:** Not intended for safety-critical tasks (e.g., gas leak detection, medical diagnosis, or regulatory use). |
|
|
|
--- |
|
|
|
## Training Data |
|
- **Olfaction data:** Language-aligned olfactory data curated from GoodScents and LeffingWell datasets. |
|
- **Vision data:** COCO dataset. |
|
- **Language data:** Smell descriptors and text annotations curated from literature. |
|
|
|
For more information on how the training data was accumulated, please see the [HuggingFace dataset URL here](https://huggingface.co/datasets/kordelfrance/olfaction-vision-language-dataset) |
|
|
|
--- |
|
|
|
## Directory Structure |
|
|
|
```text |
|
DiffusionGraphOlfactionModels/ |
|
βββ data/ # Example dataset |
|
βββ src/ # Model training and inferenct tools |
|
βββ notebooks/ # Colab-ready notebooks |
|
βββ models/ # Pre-trained models for immediate use |
|
βββ requirements.txt # Python dependencies |
|
βββ LICENSE # Licensing terms of this repository |
|
βββ README.md # Overview of repository contributions and usage |
|
``` |
|
|
|
--- |
|
|
|
## Getting Started |
|
|
|
The easiest way to get started is to open the Colab notebook and begin there. |
|
To explore the model and train locally, follow the steps below: |
|
|
|
#### 1. Clone the Repository |
|
|
|
```bash |
|
git clone https://github.com/KordelFranceTech/Diffusion-Graph-Olfaction-Models.git |
|
cd DiffusionGraphOlfactionModels |
|
```` |
|
|
|
#### 2. Create a Virtual Environment |
|
|
|
```bash |
|
python -m venv env |
|
source env/bin/activate # On Windows: .\env\Scripts\activate |
|
``` |
|
|
|
#### 3. Install Dependencies |
|
|
|
```bash |
|
pip install -r requirements.txt |
|
``` |
|
|
|
#### 4. Run Inference or Train Models |
|
Run inference: |
|
```bash |
|
python scripts/main.py |
|
``` |
|
Train Models: |
|
```bash |
|
jupyter notebook notebooks/Olfaction_Diffusion-Train.ipynb |
|
``` |
|
|
|
--- |
|
|
|
## Citation |
|
|
|
If you use these models in your research, please cite as follows: |
|
|
|
```bibtex |
|
@misc{france2025diffusiongraphneuralnetworks, |
|
title={Diffusion Graph Neural Networks for Robustness in Olfaction Sensors and Datasets}, |
|
author={Kordel K. France and Ovidiu Daescu}, |
|
year={2025}, |
|
eprint={2506.00455}, |
|
archivePrefix={arXiv}, |
|
primaryClass={cs.RO}, |
|
url={https://arxiv.org/abs/2506.00455}, |
|
} |
|
``` |
|
|
|
--- |
|
|
|
|
|
## License |
|
|
|
This dataset is released under the [MIT License](https://opensource.org/license/mit). |
|
|