Hi, I am Shivanshu. A Senior Undergrad at IIT Kanpur
I'm a Software developer and a Computer Vision practitioner with experience in REST APIs, AWS, WebApp development and applications of computer vision. I'm intrested in Computer Vision, Human Vision and the relation between the two as a research topic. I like building things using software and have worked with application development, deep learning applications in [ object, human, speech, emotion, face, action ] detection/recognition. I've trained deep learning models for EdgeNeural.ai and Siemens as an intern and have worked with OpenCV to incorporate speech recognition under GSoC in 2021 and 2022. I'm also a Co-Founder of Unicohub, where I take care of the backend and infrastructure. We're building an ecosystem of tools for creator fan monetization.
I have a deep desire to travel the world and get to experience different cultures. So, If you ever want a travel buddy, HMU.
I love music but don't know how to play properly, still learning!
I want to influence people's lifes with my work and i try to take my decisions with this goal in mind.
In my free time, I used to enjoy swimming and basketball.
Scroll down to learn more about my work and projects
My Work
EdgeNeural.ai
AI Intern
Jan, 2022 - Apr, 2022
Implemented Quantization for different Object classification models. Incorporated MMClassification framework to train and optimize different object classification models. Implemented custom hooks to report training progress on dashboard via REST API from the model training inside a docker on AWS EC2 and save training artifacts to S3 bucket.
OpenCV | GSoC
Student Developer
May, 2021 - Sept, 2021
Pioneered the introduction of speech processing applications in OpenCV by creating a sample for Speech Recognition using DNN & VideoIO modules. Used NVIDIA’s Jasper (arXiv:1904.03288v3) model to create pre-trained onnx file compatible with OpenCV by editing the computation graph. Created pre-processing functions to extract features & decoding functions to produce transcript using jasper. Fixed a matrix conversion issue in the code base allowing users to pass 3D data from python to C++ for computation using C++ DNN functions.
Neuro Match Academy
Computational Neuro Science Summer School
July, 2021 - July, 2021
Worked with 3 other people on a research project to differentiate between the areas of brain responsible for math and language processing using fMRI data from Human Connectome Project. Learnt about the neurons, action potentials and brain. And how do we model these processes computationally. Learnt about various models from basic linear models to bayesian models to Markov Decision processes.
Brain and cognitive Society - IITK
Coordinator
May, 2021 - May, 2022
Managing the daily workings of the society. Recently, concluded the summer projects accomodating 120+ students over 7 projects.
Monetix Ltd.
Web Software Developer
May, 2020 - July, 2020
Extended existing backend written in Typescript to streamline new partner onboarding. Created a conditional beta build system & added new partners’ support in Ionic & Angular app. Automated notification & mail delivery for payment reminders in vanilla JS on Heroku backend. Scripted data migration to & from MongoDB in production equipped with data dumping capabilities.