Vision-based Personal Memory Assistant

I built this prototype to explore how AI could help with personal memory recall using visual data. Think of it as a "digital memory" system that captures images throughout your day and lets you search through them using natural language.

What I Built

This is a proof-of-concept for a wearable-like memory assistant that:

Captures images periodically via webcam (simulating smart glasses)
Detects objects using YOLOv8 to understand what's in each scene
Stores memories with timestamps and detailed descriptions
Lets you search using natural language like "When did I last see my keys?"
Provides a web interface to browse and query your visual memories

Why I Built This

I was curious about how we could use AI to augment human memory. The idea came from thinking about how often I misplace things or forget where I was at a certain time. What if I could just ask my computer "When did I last see my laptop?" and get an instant answer with a photo?

How It Works

The system uses a hybrid approach:

Local processing for object detection (YOLO) - keeps things fast and private
Cloud AI (ChatGPT) for understanding natural language queries
SQLite database to store everything locally
Streamlit web app for easy interaction

Features

🔍 Natural Language Search

Ask questions like:

"When did I last see my keys?"
"Show me when I was working at my desk"
"Find memories from today"
"When was I in the kitchen?"

📸 Smart Image Capture

Automatically captures images every 5 minutes during active hours
Uses YOLO to detect objects and understand scenes
Stores everything with timestamps and metadata

📊 Memory Analytics

Browse all your captured memories
See statistics about your daily patterns
Track what objects you interact with most

Tech Stack

I chose these technologies for a good balance of performance and ease of use:

Python - Easy to prototype and has great AI libraries
YOLOv8 - Fast, accurate object detection
ChatGPT API - Natural language understanding
SQLite - Simple, local database
Streamlit - Quick web interface
OpenCV - Image processing

Getting Started

Prerequisites

Python 3.8+
Webcam
OpenAI API key (for natural language queries)

Quick Setup

# Clone the repo
git clone https://github.com/yourusername/MemoryAssistant.git
cd MemoryAssistant

# Run the setup script
python setup.py

# Add your OpenAI API key to .env file
# Edit .env and add: OPENAI_API_KEY=your_key_here

# Start the app
streamlit run app.py

Manual Setup

# Install dependencies
pip install -r requirements.txt
python -m spacy download en_core_web_sm

# Create .env file
cp env_example.txt .env
# Edit .env and add your OpenAI API key

# Run tests
python test_setup.py

# Start the app
streamlit run app.py
- **Natural Language Queries**: Ask questions like "When did I last see my keys?" or "Show me when I was working at my desk"
- **Memory Storage**: Structured storage with timestamps and metadata
- **Interactive Demo**: Web-based interface for querying and browsing memories

## Tech Stack

- **Vision**: YOLOv8 + OpenCV
- **NLP**: ChatGPT API + spaCy
- **Frontend**: Streamlit
- **Backend**: FastAPI
- **Database**: SQLite + FAISS (vector search)
- **Storage**: Local file system

## Quick Start

1. **Install Dependencies**:
   ```bash
   pip install -r requirements.txt
   python -m spacy download en_core_web_sm

Set up Environment Variables:

cp .env.example .env
# Add your OpenAI API key to .env

Run the Application:
```
streamlit run app.py
```

Project Structure

MemoryAssistant/
├── app.py                 # Main Streamlit application
├── capture/               # Image capture system
│   ├── camera.py         # Webcam integration
│   └── scheduler.py      # Periodic capture logic
├── vision/               # Computer vision processing
│   ├── detector.py       # YOLO object detection
│   └── processor.py      # Scene understanding
├── memory/               # Memory storage and retrieval
│   ├── database.py       # SQLite database operations
│   ├── storage.py        # File storage management
│   └── query.py          # Natural language query processing
├── api/                  # ChatGPT API integration
│   └── openai_client.py  # OpenAI API wrapper
├── ui/                   # User interface components
│   └── components.py     # Streamlit UI components
├── config/               # Configuration files
│   └── settings.py       # Application settings
└── data/                 # Data storage
    ├── images/           # Captured images
    ├── database/         # SQLite database
    └── embeddings/       # Vector embeddings
>>>>>>> 97c227aa902e1220da5301ba17c5e8a868905970

Usage

<<<<<<< HEAD

Start the app: streamlit run app.py
Capture some images: Click "Capture Image Now" in the sidebar
Search your memories: Try queries like "When did I last see my phone?"
Browse recent captures: Check the "Recent Captures" tab
View statistics: See your memory patterns in the "Statistics" tab

Demo

You can run a demo without needing an API key:

python demo.py

This shows the system working with sample data.

Configuration

Edit the .env file to customize:

# Capture settings
CAPTURE_INTERVAL=300          # Capture every 5 minutes
CAPTURE_ACTIVE_HOURS_START=8  # Start at 8 AM
CAPTURE_ACTIVE_HOURS_END=22   # Stop at 10 PM

# AI settings
OPENAI_MODEL=gpt-3.5-turbo    # Use GPT-3.5 for faster responses
CONFIDENCE_THRESHOLD=0.5      # Object detection confidence

What I Learned

Building this taught me a lot about:

Hybrid AI approaches - Local processing for speed, cloud for intelligence
Privacy considerations - Keeping sensitive data local while using cloud AI
User experience - Natural language makes AI much more accessible
Rapid prototyping - Streamlit is amazing for quick demos

Future Ideas

If I continue this project, I'd like to add:

Auto-capture scheduling based on activity
Cloud storage for backup and sharing
Mobile app companion
Voice queries for hands-free use
Activity prediction to anticipate what you might be looking for

Troubleshooting

Camera Issues

Make sure your webcam is connected and accessible
On macOS, you'll need to grant camera permissions
Try different camera index in .env file

OpenAI API Issues

Verify your API key is correct
Check your OpenAI account has credits
Ensure you're using a supported model

Performance Issues

Use yolov8n.pt (nano) model for faster processing
Reduce capture frequency in .env
Close other applications using the webcam

Contributing

This is a personal project, but I'm open to ideas and improvements! Feel free to:

Report bugs
Suggest new features
Submit pull requests

License

MIT License - feel free to use this code for your own projects.

Built with ❤️ to explore the future of AI-assisted memory

Start Capture: The system will automatically begin capturing images every 5 minutes
Query Memories: Use natural language to search through your visual memories
Browse Gallery: View all captured images with timestamps and descriptions

Development Roadmap

License

MIT License

97c227aa902e1220da5301ba17c5e8a868905970

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vision-based Personal Memory Assistant

What I Built

Why I Built This

How It Works

Features

🔍 Natural Language Search

📸 Smart Image Capture

📊 Memory Analytics

Tech Stack

Getting Started

Prerequisites

Quick Setup

Manual Setup

Project Structure

Usage

Demo

Configuration

What I Learned

Future Ideas

Troubleshooting

Camera Issues

OpenAI API Issues

Performance Issues

Contributing

License

Built with ❤️ to explore the future of AI-assisted memory

Development Roadmap

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
api		api
capture		capture
config		config
data		data
memory		memory
vision		vision
.gitignore		.gitignore
FINAL_SUMMARY.md		FINAL_SUMMARY.md
IMPLEMENTATION_SUMMARY.md		IMPLEMENTATION_SUMMARY.md
QUICKSTART.md		QUICKSTART.md
README.md		README.md
app.py		app.py
demo.py		demo.py
env_example.txt		env_example.txt
requirements.txt		requirements.txt
setup.py		setup.py
test_setup.py		test_setup.py

Folders and files

Latest commit

History

Repository files navigation

Vision-based Personal Memory Assistant

What I Built

Why I Built This

How It Works

Features

🔍 Natural Language Search

📸 Smart Image Capture

📊 Memory Analytics

Tech Stack

Getting Started

Prerequisites

Quick Setup

Manual Setup

Project Structure

Usage

Demo

Configuration

What I Learned

Future Ideas

Troubleshooting

Camera Issues

OpenAI API Issues

Performance Issues

Contributing

License

Built with ❤️ to explore the future of AI-assisted memory

Development Roadmap

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages