Foundational Model for Neurosurgical Video Intelligence
Training a foundation model on 100+ hours of unlabelled neurosurgical video — learning instrument detection, pose, tissue segmentation, and 3D reconstruction directly from footage.
Chapter I · Groundwork
Training a foundation model on 100+ hours of unlabelled neurosurgical video — learning instrument detection, pose, tissue segmentation, and 3D reconstruction directly from footage.
Enhanced a commercial RAG system for data privacy and automatic user personalisation — training a local LLM with a custom user-embedding layer on RLHF to adapt response style to individual personas.
Trading agent using CNNs to fuse visual, textual, and trend-based market signals.
Geospatial monitoring system using dual-camera and autonomous drones.
Scalable searchable document system built with independent Docker containers.
ResNet + CLIP ensemble geolocating images to country-level accuracy.
Automated UML diagram generation from source code using static analysis.
Map-based itinerary builder with real-time suggestions and route visualization.
Real-time age estimation from facial images — reliability-focused deep learning.
Ranking system using multiple specialized models for wildlife auctions.
CNN-based monitoring system identifying regions of interest in industrial video.
Retrieval-augmented generation using word embeddings for minimal hallucinations.
LSTM-powered music generation trained on Jazz corpora to create original melodies.
Intelligent trigger word detection using robust background noise training.
Mathematical image processing for calculating cable car wire spans.
Learning to thread a catheter through anatomy with DQN & imitation learning.
Have something to build, train, or scale?
I'm open to research collaborations, freelance engagements, and full-time roles — especially where medical imaging, foundation models, or RL are involved.