Gabriel Busto

summary

this is a page i wanted to put together with a list of project ideas, resources, platforms, tools, etc that i’ve come across in case it’s helpful to others. i’ll likely exclude really obvious sites and tools like chatgpt, claude, etc.

books
videos
project ideas
sites and platforms
repositories (coming soon!)

books

Neural Networks from Scratch - a book that is exactly what it sounds like. learn to build neural networks from scratch using pure python.
Deep Learning with Python - haven’t bought this book, but it looks like a great resource

videos / channels

3Blue1Brown - amazing educational videos related to math and similar topics
- Deep Learning / Nueral Networks Playlist - an awesome intro to neural networks and deep learning, and even covers llms and transformers

project ideas

a collection of ideas and things to explore to get hands on experience. i’m a bit sparse on the details for now, but will improve this list over time.

get a very small model running on your own local machine
- you could use ollama, but trying going to huggingface and finding a very small model that is <= 1B params and use the provided example code
train a small model on your machine; you can use Unsloth
same as the above two, but try running and/or training a larger model on a platform like Modal
after getting a model running with a very simple setup (especially a larger one on modal) use vllm for hosting that model and compare how much faster it is. they have a guide for using vllm on modal
get an open source/weights image gen model running locally or remotely
- explore options to speed up inference/diffusion
get an open source/weights audio/speech model running locally or remotely
- explore options to speed up inference/diffusion
build a really simple app using a cool new model like Gemini’s Nano Banana
get an old school state of the art model running; gpt-2 and grok-2 are the first ones that come to mind
train your own small neural network from scratch

sites and platforms

there are SO many. too many to list. but i’ll try and keep it up to date as i remember / learn about more.

Modal - write python code on your machine, and run it remotely on their gpus.
- they have some cool educational resources. they have lots of examples for what you can do on modal, a well written guide on how to use their platform, and a gpu glossary
Hugging Face 🤗 - the ai community. think of it like github, but for ai and ai models.
- companies and individual upload their latest models (literally every kind you can imagine), fine tunes, data sets for training, etc. they also provide infrastructure to deploy and manage your own models and fine tuning. probably one of the best resources out there
Unsloth - an open source framework for llm fine-tuning and reinforcement learning; they try to make it faster and easier.
- they’re also a great learning source on top of their tooling to help speed up model training with lots of guides
Replicate - tons of hosted models (and even fine-tunes) that you can run from an api with pay-as-you pricing easily.
- i think they focus more on image/video models, but they do have some llms on there. you can even do finetuning on there, and deploy your own custom models
Fal - in my mind, this is basically just a competitor of Replicate and they offer the ~same exact thing as far as i can tell (not a bad thing!)
Elevenlabs - an audio ai platform; text to speech (TTS), speech to text (STT), realtime speech, voice cloning, sound effects, music, dubbing, etc
AssemblyAI - focused entirely (at least they used to be) on speech to text.
- they provide amazing state of the art models for fast, cheap, and highly accurate transcription. including fancy features like diarization and recognizing multiple speakers, extracting insights from text, and even more advanced features like filtering out PII.
arxiv - the place where it seems like every paper for ai/ml/math gets published. anytime you see someone talking about a new paper, they’re likely linking to it on arxiv.
Luma Labs - a frontier multimodal (primarily video?) lab.
- they have image generator models, video generator models, and even a 3d model generator
- they recently released their new state of the art video model: Ray3
  - it is STUNNING. and it’s an incredibly cool architecture that is capable of thinking and reasoning in visuals. they have some really neat capabilities too.
Meshy - a state of the art ai 3d platform.
Factory AI - agent-native software development. their agents are called Droids. don’t know much about it, but it looks cool!
Thinking Machines - a new lab started by the former cto of openai (Mira Murati).
- they have a blog with some posts on it; they will likely continue to put out great blog posts
- just released their first product: Tinker. it’s a training api; sounds nice, but in my opinion not all that different than what modal offers. i get the impression it will be a bit more “done for you” than what modal offers.
Together AI - i’m not super clear on what this is, but it seems like a platform to train and host open source models with production-level infrastructure
Fireworks AI - i think it’s ~similar to Together AI. one notable thing about these guys though is that their platform emphasized privacy, and are even HIPPA compliant!
Comfy - an open source, node-baesd app for genai. they have a desktop app you can use to run it locally. i hear a lot about it but have never used it personally
Weavy - a powerful artistic tool that lets you build custom and complex flows with text, image, and video gen models
Weaviate - not to be confused with Weavy! they provide a RAG database service, and also have an open source implementation
Unstructured - helps you transform complex and unstructured data into structured data; i’ve used it before for their advanced chunking strategies prior to saving a RAG database.