MedicalPaperSummarizer

A machine learning project that fine-tunes a T5 model to generate structured summaries of medical research papers from PubMed abstracts.

Overview

This project trains a sequence-to-sequence model to automatically generate comprehensive summaries of medical papers including:

Plain-language summary
Key findings
Clinical relevance
Methodology brief

Requirements

Python 3.12 (required - Python 3.13 has compatibility issues with PyTorch on macOS)
PyTorch
Transformers (Hugging Face)
Datasets
Other dependencies listed below

Installation

1. Install Python 3.12

Using Homebrew (recommended for macOS):

brew install python@3.12

2. Create Virtual Environment

cd MedicalPaperSummarizer
/opt/homebrew/bin/python3.12 -m venv venv312
source venv312/bin/activate

3. Install Dependencies

pip install torch transformers datasets evaluate rouge-score sentencepiece accelerate

Usage

Training the Model

# Activate the virtual environment
source venv312/bin/activate

# Run the training script (the entrypoint is `train_model.py`)
python train_model.py

The script will:

Load PubMed articles from pubmed_abstracts.json
Process and structure the abstracts
Fine-tune the T5-small model
Save the trained model to pubmed-summarizer-best/

Training Parameters

Model: google-t5/t5-small
Max steps: 5 (for quick testing)
Batch size: 4
Learning rate: 5e-5
Device: CPU (configured for compatibility)

Project Structure

MedicalPaperSummarizer/
├── train_model.py           # Main training script (Python 3.12 compatible)
├── get_data.py              # Script to fetch PubMed data
├── run_train.sh             # Helper script to launch training with env vars set
├── run_training.sh          # Alternative launcher with additional macOS tweaks
├── pubmed_abstracts.json    # Input data (PubMed articles)
├── pubmed-sum/              # Training outputs
└── pubmed-summarizer-best/  # Final trained model

Known Issues & Solutions

Python 3.13 Mutex Lock Issue

If you encounter [mutex.cc : 452] RAW: Lock blocking errors, you're likely using Python 3.13. This is a known PyTorch bug on macOS. Solution: Use Python 3.12 as shown in the installation steps.

Overflow Error During Evaluation

Fixed in train_model.py by properly handling tensor conversions and clipping values to valid ranges.

Data Format

The input data (pubmed_abstracts.json) should contain PubMed articles with structured abstracts including sections like:

Background/Introduction
Methods
Results
Conclusions

Output Format

The model generates structured summaries in the following format:

# Plain-language summary
[3-sentence accessible summary]

# Key findings
- [Finding 1]
- [Finding 2]
- [Finding 3]
- [Finding 4]

# Clinical relevance
[Clinical implications and applications]

# Methodology brief
[Brief description of study methodology]

Contributing

Feel free to submit issues and pull requests!

License

MIT License

Acknowledgments

Built with Hugging Face Transformers
Uses Google's T5 model
PubMed data from NCBI

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
README.md		README.md
get_data.py		get_data.py
pubmed_abstracts.json		pubmed_abstracts.json
run_train.sh		run_train.sh
run_training.sh		run_training.sh
test_model.py		test_model.py
train_model.py		train_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MedicalPaperSummarizer

Overview

Requirements

Installation

1. Install Python 3.12

2. Create Virtual Environment

3. Install Dependencies

Usage

Training the Model

Training Parameters

Project Structure

Known Issues & Solutions

Python 3.13 Mutex Lock Issue

Overflow Error During Evaluation

Data Format

Output Format

Contributing

License

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

trinitron88/MedicalPaperSummarizer

Folders and files

Latest commit

History

Repository files navigation

MedicalPaperSummarizer

Overview

Requirements

Installation

1. Install Python 3.12

2. Create Virtual Environment

3. Install Dependencies

Usage

Training the Model

Training Parameters

Project Structure

Known Issues & Solutions

Python 3.13 Mutex Lock Issue

Overflow Error During Evaluation

Data Format

Output Format

Contributing

License

Acknowledgments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages