
Master of Science: Computer Science
North Carolina State University
Raleigh, North Carolina, USA | 2022 - 2024
Focused on AI/ML, NLP, cloud computing, and big data analytics.
I am a Data Scientist and Machine Learning Engineer with expertise in AI, NLP, Deep Learning, and Generative AI. My work focuses on developing scalable AI solutions, automating workflows, and deriving insights from large datasets. With hands-on experience in machine learning, deep learning, and cloud-based AI, I have built retrieval-augmented generation (RAG) pipelines, image processing systems, and predictive models for real-world applications.
Currently, I am working on intelligent automation systems and AI-powered applications, leveraging technologies like PyTorch, TensorFlow, Scikit-learn, and Hugging Face for model development, and using Docker, AWS, and GCP for scalable deployments. My recent projects and work experiences include designing an image classification system using Vision LLMs (GPT-4o), optimizing NLP models for syllabus analysis, and creating an AI-driven personalized email outreach system.
With a strong foundation in Python, Go, SQL, and R, I combine data science and software engineering to develop impactful AI-driven solutions. Passionate about problem-solving, I continuously explore new frontiers in AI, cloud computing, and MLOps to build innovative and efficient systems.
View My ProjectsNorth Carolina State University
Raleigh, North Carolina, USA | 2022 - 2024
Focused on AI/ML, NLP, cloud computing, and big data analytics.
Kalinga Institute of Industrial Technology
Bhubaneswar, Odisha, India | 2018 - 2022
Specialized in software development, machine learning, and data structures.
Taral AI
Remote, USA | September 2024 - Present
Worked on designing and implementing a Docker-based image processing system using Go, MongoDB, NATS, and Vision LLMs to classify images as ‘dirty’ or ‘clean’ for automating cleaning task scheduling in schools. Improved database query performance by 40% and built an asynchronous messaging system handling 500+ concurrent connections.
Tech Stack: Docker, Go, MongoDB, NATS, Vision LLMs, Asynchronous Messaging
North Carolina State University
Raleigh, North Carolina, USA / Remote | April 2024 - Present
Leading a study to improve curriculum development by analyzing past syllabus data in the field of biochemistry. Worked on applying transformer models like bioBERT and RoBERTa to analyze 50,000+ university syllabi, designing a Retrieval-Augmented Generation (RAG) pipeline, and uncovering statistical insights into course content distribution.
Tech Stack: bioBERT, RoBERTa, Transformer Models, RAG Pipeline, Python, Pandas
Verzeo
Bhubaneswar, Odisha, India | June 2020 - August 2020
Led a 4-member team in developing an XGBoost regression model for car price prediction, achieving 92% accuracy and a 0.85 R² score. Improved model performance by 15% through feature engineering and selection using LASSO regression. Developed interactive Tableau dashboards for data-driven decision-making.
Tech Stack: XGBoost, LASSO Regression, Tableau, Python, Scikit-learn
Here are some of the projects I have worked on. These projects showcase my skills in data science, machine learning, image processing, and cloud-based solutions.
Built a cold email generator using Python, LangChain, and Llama 3.1 to connect with hiring managers, achieving 95% relevance accuracy by using an embedding-based retrieval system with ChromaDB.
Key Achievements:
Developed a CNN-based solution for classifying crop diseases with 95% detection accuracy. The solution leverages Python, TensorFlow, and Keras to analyze crop leaf images and classify diseases across multiple categories.
Key Achievements:
Built a deep learning-based system for multivariate time series prediction to forecast climate variables like temperature, pressure, and humidity. Utilized LSTM, GRU, and CNN models to accurately predict climate patterns based on historical data from multiple locations.
Key Achievements:
Engineered a highly efficient, large-scale book recommendation system, designed to provide personalized reading suggestions to over 1 million users. Leveraging the power of Apache Spark, MLlib, and PySpark, I processed vast datasets of user ratings and book metadata, delivering lightning-fast, tailored recommendations.
Key Achievements:
Bookipedia is a full-fledged online book store where customers can explore and purchase their favorite books with ease. Built with a robust frontend and backend, this platform ensures seamless browsing, secure transactions, and an intuitive user experience.
Key Achievements:
PlagDetector is an advanced NLP-based project that identifies whether a given text is generated by AI or written by a human. The system leverages transformer models, text summarization techniques, and semantic similarity analysis to detect AI-generated content with high accuracy.
Key Achievements: