Skip to content
View YuvrajSingh-mist's full-sized avatar

Block or report YuvrajSingh-mist

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
YuvrajSingh-mist/README.md

Hi 👋, Myself Yuvraj Singh

A passionate AI/ML developer inclined towards NLP and CV (Multimodality). Aspire to pursue research abroad in the same domain

  • 🔭 I’m currently working on Fine-tuning LLMs and Research Paper implementations

  • 🌱 I’m currently learning about Reinforcement Learning Techniques and its intersection with LLM's outputs

  • 🤝 I’m looking for RE/RS intern or FTE roles, preferably in NLP and Computer Vision

  • 📫 How to reach me yuvraj.mist@gmail.com


🛠️ Languages and Tools :

Java  React  Spring  Material UI  Flutter  CSS  Firebase 

🔥 My Stats :

Yuvraj Singh's Streak

[Yuvraj's github activity graph]

Pinned Loading

  1. Paper-Replications Paper-Replications Public

    A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch

    Jupyter Notebook 373 40

  2. SmolLlama SmolLlama Public

    So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset form HuggingFace consisting of 15 M texts (10BT snapshot) f…

    Python 15 4

  3. StoryLlama StoryLlama Public

    Trained a Llama a 88M architecture I coded from ground up to build a small instruct model, going through the below-mentioned stages from scratch. Trained on TiyStories dataset form HuggingFace cons…

    Python 6 1

  4. SmolMixtral SmolMixtral Public

    So, I trained a MoE based a 124M (8x12M) architecture I coded from ground up to build a small instruct model, going through the below-mentioned stages from scratch. Trained on TiyStories dataset fo…

    Python 11 1

  5. SmolWhisper SmolWhisper Public

    Trained a Whisper model a ~30M (whisper tiny.en) architecture I coded from ground up to build a small ASR model, going through the below-mentioned stage from scratch. Trained on GigaSpeech dataset …

    Python 7 1

  6. Reinforcement-Learning-From-Scratch Reinforcement-Learning-From-Scratch Public

    Repository of implementations of classic and sota rl algorithms from scratch in PyTorch

    Python 185 18