I am Neethu Mariya, a results-driven data scientist and data engineer with a passion for leveraging data to drive meaningful insights and solutions. With a solid foundation in machine learning, statistical analysis, and data engineering, I have honed my skills through practical experience and academic pursuits, including a master's degree in Data Science from University of Waterloo. Here, you'll find a showcase of my work, demonstrating my expertise in solving real-world challenges and delivering impactful results.
Picked up the skill over years through actual coding and getting my hand dirty through various projects.
Specialized in Machine Learning and Neural network through projects and course-work
What? Why? What's next? is how I get my brains around a problem. And solving them has always been my thing.
I can grasp a new concept over short span of time and I keep motivating myself through learning new things on a continous basis.
Skilled and trained at creating excellant visualizations with perfect balance between perception and cognition.
Designed as an optimization problem called the Vertex-Cover problem, this project is aimed at minimizing the installations of security cameras in streets for effective monitoring. Implemented using the graph theory and reduced it to CNF-SAT solvable format. This is a project with multiple coding assignments written in Python and C++, that communicates with each other via Inter-Process Communication(IPC). The project implements multi-threading and parallel processing to run more efficiently.
see Project
A Literature Review on the state-of-the-art Crowd Counting techniques using CNN. This 15-min-read not only gives the gist of the existing techniques, but also compares them with each other and gives a detailed analysis that highlights the improvements made by each authors from the previous ones. The document concludes by giving an insight on the open problems that are yet to be solved.
see Project
This project aims to analyze the quality of the Portuguese 'Vinho Verde' wine to build a model to predict the quality of the two variants(red and white) of the wine in terms of the selected variables in the best possible way.
see Project
This project aims to give a detailed step-by-step analysis of the time series data, collected from the lacity.org website in an attempt to analyze its web traffic pattern. Here, we have come up with a descriptive analysis and a predictive model. You could find the detailed report of the project in my github repository, where we have addressed to the weekly cycle pattern and outliers.
see Project
This is an ML project that classifies e-commerce products to 27 categories. The data includes categorical features, noisy text description and noisy images for each product. The python code is trained to make use of both the text and the images to accurately classify the products. It makes use of Recurrent Neural Net(RNN) with LSTM units to train the text description and ResNet model train the noisy images and finally ensemble learning techniques to combine the individual predictions. This model classifies the products with 94% accuracy. The data is available at https://www.kaggle.com/c/uw-cs480-fall20/data.
see Project