This tool analyzes RNA structure prediction data from Sfold and evaluates design candidates for RNA targeting.
https://drive.google.com/drive/folders/1tbaWjBWFPyLOUTGmsu1Eekzj33FgtDuP
Check out running SFOLD to run the sfold backend
Go into sagent-frontend folder and run
npm i
npm run devThe literature on Sfold and RNA sequences is in /literature. We use xtrace to create a knowlege base in vectorstore.ipynb
This repository contains the Sfold suite of tools.
To download the source code and executables, click on the green "Code" button.
To install and load sfold, read the RUNNING_SFOLD file in this directory.
The starmir scripts for binding site prediction and ranking are in the subfolder "STarMir".
These programs only run under Linux. It is probable they could be compiled and run under other *nix operating systems.
The project is comprised of three main parts. The dockerized SFold projet, front end interface, and NEAR agents.
In a traditional approach, it would take approximately 16.57 hours to read through multiple papers and manually extract the relevant information. This process involves skimming, identifying key points, and analyzing content, which is time-consuming and prone to human error.
In contrast, using the NEAR AI Agent, this entire process can be completed in seconds. The AI agent leverages advanced natural language processing techniques to quickly sift through large volumes of text, accurately extracting and summarizing the most pertinent information. This dramatic reduction in time not only boosts productivity but also ensures more reliable and consistent results, highlighting the efficiency and power of AI in research and data analysis.
Some other value adds include repository being dockerized.
-
Install the required dependencies:
pip install -r requirements.txt -
Set your OpenAI API key as an environment variable:
export OPENAI_API_KEY='your-api-key'Alternatively, you can create a
.envfile in the project root with:OPENAI_API_KEY=your-api-keyAnd then load it in your code with:
from dotenv import load_dotenv load_dotenv()
Run the main script from the agent directory:
cd agent
python main.py
By default, the script will:
- Look for Sfold output data in
../output - If not found, it will use dummy data for demonstration
- Evaluate design candidates using OpenAI's API
- Rank the designs based on therapeutic potential
- Generate a recommendation report in
recommendation_report.md
file_parser.py: Parses Sfold output filesdesign_evaluator.py: Evaluates RNA design candidates using LLMranking_agent.py: Ranks designs based on multiple criteriareport_generator.py: Generates a final recommendation reportmain.py: Orchestrates the workflow
You can modify the weights used for ranking in the RankingAgent initialization:
ranking_agent = RankingAgent(
weight_therapeutic=0.4,
weight_research=0.2,
weight_understanding=0.2,
weight_versatility=0.2
)-
Have your Sagent Key in
./sagent/as "sagent-key.pem" Checkout CONNECTING_TO_AWS and SETUP_SSL to setup remote instance -
Install Anaconda
-
Based on NEAR AI GitHub, create NEAR AI account (speaker rec Meteor)
chmod 400 "sagent-key.pem"
ssh -i "sagent-key.pem" admin@ec2-18-216-25-202.us-east-2.compute.amazonaws.com
conda create -n useful_agent python=3.10 -y
conda activate useful_agent
pip install nearai