|

ESTABLISHED 2025

VOL. XXIVISSUE 42Monday, March 24, 2025

Vishal Singh Baraiya

I am a student at IIT Madras, pursuing a BS in Data Science and Applications. Currently, I am working on asynchronous neural networks, exploring novel approaches to enhance efficiency and scalability in deep learning. Alongside AI, I also have experience in web development.

I have successfully reproduced GPT-2, LLaMA, and Mistral from research papers, deepening my understanding of transformer architectures and large-scale language models. Additionally, I am part of AI4Bharat, contributing to the development of foundation models for India.

At A Glance

  • BS in Data Science, IIT Madras
  • Reproduced GPT-2, LLaMA, and Mixtral from research papers.
  • Researching on Asynchronous Neural Network.
  • Contributed to 5+ open-source AI frameworks
  • Experienced in building AI wrappers.
Portrait of the data scientist
Leading Data Scientist & AI Engineer

Latest Updates

  • MAY 2023:Vishal started BS in DS and AI at IIT Madras.
  • MAY 2024:Completed Foundation Level & Started Diploma Level
  • MARCH 2025:Started doing research on Asynchronous Neural Network

Featured Projects

Web Interface for Finetuning AI Models
March 2025

Web Interface for Finetuning AI Models

This web interface simplifies fine-tuning models by providing an intuitive platform for dataset management, training configuration, and real-time monitoring. Users can customize hyperparameters, track progress, and deploy models seamlessly

Read More →
Clone of Bolt and V0
December 2024

Clone of Bolt and V0

This AI-powered web development tool, inspired by Bolt and V0, acts as an AI wrapper that automates website creation. Users input requirements in natural language, and the system generates optimized, production-ready web applications.

Read More →
GitHub Repo Maintainer
Febuary 2025

GitHub Repo Maintainer

This AI-powered GitHub repo maintainer automates bug fixes, feature additions, and code management based on user prompts. It streamlines repository maintenance by analyzing issues, generating solutions, and committing updates autonomously.

Read More →
Reproduced LLaMA and Mixtral architectures
August 2024

Reproduced LLaMA and Mixtral architectures

Reproduced LLaMA and Mixtral architectures from research papers by implementing their core components from scratch, ensuring structural alignment with the original designs. Focused on replicating model architecture

Read More →

PROJECT OUTPUT OVER TIME

Figure 1: Growth trajectory of research output and machine learning projects over time

Technical Analytics

Skills Assessment

0%20%40%60%80%100%Machine LearningData AnalysisDeep LearningData VisualizationNLPComputer Vision90%85%80%75%70%65%
*Based on project experience

Technical Expertise

Programming

  • Python
  • Rust
  • Go
  • C/C++
  • JavaScript

Machine Learning

  • Supervised Learning
  • Deep Learning
  • Natural Language Processing
  • Computer Vision

Data Engineering

  • ETL Pipelines
  • Data Warehousing
  • Big Data (Spark)
  • Cloud Platforms

Tools & Frameworks

  • TensorFlow/PyTorch
  • scikit-learn
  • Pandas/NumPy
  • Docker/Kubernetes

Classified Advertisements

CONTACT INFORMATION

For inquiries, collaborations, or professional opportunities, please reach out through the following channels: