Visualizing image embeddings in a 2D space through the FiftyOne app
This repository provides tutorials for people at various stages of learning computer vision. It covers foundational concepts, such as image representation and the basics of neural networks, and extends to more advanced topics including generative models and zero-shot learning. The material is designed to be useful for those new to the field as well as practitioners looking to work with contemporary tools and techniques.
Before starting the notebooks, prepare your environment using these setup guides:
- Installing Python 3.11
- Installing FiftyOne with venv (for local development)
- Running inference notebooks inside a Dev Container (for a consistent Dockerized environment)
- Setting up GPU and TPU usage on Kaggle Notebooks
- Setting up a wandb API token and using it on Kaggle Notebooks and Google Colab (for experiment tracking)
- Setting up kagglehub on Google Colab (for accessing Kaggle models and datasets)
- Setting up a HuggingFace token (for accessing models and datasets from Hugging Face Hub)
- Copying data folders without download through Google Drive (a method for large datasets)
This collection of tutorial notebooks is organized to progress from foundational concepts to advanced applications. Click the badges to open them in Google Colab or Kaggle.
For a structured learning experience with quizzes and assignments, enroll in the full course on openHPI:
Video tutorials are available:
- YouTube Playlist
- Review Questions for each video to check understanding.
For questions, tutorial discussions, or project sharing, join the FiftyOne Community Discord. The channel for this content is #practical-computer-vision-workshops.
- FiftyOne Community Discord - Invite Link
- Direct Channel Link (You must accept the invite via the link above first!)
You are welcome to fork this repository, experiment with the code, and contribute. Please maintain the Creative Commons Attribution License.
antonio.rueda.toicen 'at' hpi 'dot' de
This work is licensed under a Creative Commons Attribution 4.0 International License.
Happy learning!