NarrowMind

A lightweight statistical language model for question-answering and text generation.

NarrowMind combines statistical n-gram modeling with modern language modeling techniques (temperature sampling, top-k sampling, TF-IDF) to provide fast, memory-efficient language understanding without neural networks.

Features

Question Answering: Understands questions with wildcards (who, what, where, when, why, how)
Text Generation: Generates contextually relevant continuations
Multi-gram Ensemble: Weighted combination of bigrams and trigrams
TF-IDF Semantic Search: Finds semantically similar content
Temperature & Top-k Sampling: Same concepts as GPT for controlled randomness

Quick Start

Place your training text in input.txt
Run cargo run

Ask questions using question words as wildcards:

> who was getting ready
> what did mia realize
> quit

How It Works

graph LR
    A[Training Data] --> B[Tokenization]
    B --> C[N-gram Stats]
    B --> D[TF-IDF Vectors]
    B --> E[Sentence Index]
    
    F[User Query] --> G{Direct Match?}
    G -->|Yes| H[Answer]
    G -->|No| I[Power Set + TF-IDF Search]
    I --> J[Top Sentences]
    J --> K[Generate Candidates]
    K --> L[TF-IDF Boost]
    L --> M[Temperature + Top-k]
    M --> H

Algorithm Order:

Direct Pattern Matching - Fast exact text search
Power Set Matching - Finds sentences matching word combinations
TF-IDF Similarity - Semantic vector search (fallback)
Multi-gram Ensemble - Combines bigrams & trigrams with weights
TF-IDF Relevance Boost - 1.0-3.5x multiplier for contextually relevant words
Temperature & Top-k Sampling - Controlled randomness

Example

Training: "Mia was getting ready for school. She realized she forgot her homework."

Query: "who was getting ready"
Response: "Mia was getting ready for school."

Configuration

let mut model = LanguageModel::new(3);
model.set_temperature(0.8);  // Lower = more deterministic
model.set_top_k(20);         // Limit to top 20 candidates
model.train(&training_data);

Requirements

Rust 1.70+
Training data in input.txt
Dependencies: rand crate only

Comparison with GPT

Feature	NarrowMind	GPT
Architecture	Statistical n-grams	Neural network
Memory	~MBs	~GBs to TBs
Speed	Instant	Slower
GPU Required	No	Yes
Temperature/Top-k	✅ Yes	✅ Yes
Semantic Search	✅ TF-IDF	✅ Embeddings

NarrowMind: Think small, understand deeply. 🧠

Name		Name	Last commit message	Last commit date
Latest commit History 107 Commits
.vscode		.vscode
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
input.txt		input.txt
input1.txt		input1.txt
language_model.md		language_model.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NarrowMind

Features

Quick Start

How It Works

Example

Configuration

Requirements

Comparison with GPT

About

Uh oh!

Releases 1

Packages

Languages

License

ItzmeJan/NarrowMind

Folders and files

Latest commit

History

Repository files navigation

NarrowMind

Features

Quick Start

How It Works

Example

Configuration

Requirements

Comparison with GPT

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages