
About me
Hey there! I am Shazia Parveen, and I am thrilled to have you here. As a Master of Software Engineering graduate from Carnegie Mellon University, I have developed a strong foundation in Software Development, Cloud Computing, AI Engineering, and Software Project Management.
Whether it is enhancing business processes, improving user interactions, or pushing the boundaries of technology, I'm ready to take on the most challenging puzzles.
From crafting insurance solutions to architecting data syndication services and developing a movie recommendation system, my diverse portfolio reflects a commitment to applying Software Engineering principles to real-world challenges.
In this ever-evolving world, I believe in the power of continuous learning. Each project, each line of code, and every constructive discussion is an opportunity to refine my skills and expand my knowledge base.
Please view my detailed resume here
Skills
Other Skills: C, C++, Scala, Golang, Rust, Git, Github, Boto3, Git, Pandas, Flask, PySpark, Pytest, Django, Jenkins, CircleCI, Azure DevOps, Jira, Selenium, Cypress, Jest, Postman, JaCoCo, PITest, Figma, Latex, Code Documentation, Distributed systems, Database design.
WORK EXPERIENCE
My Journey in Software Development
PROJECTS
A Gallery of my Notable Projects. Please visit my Github.
Web Development
Data Syndication Service Development and Integration
Led a team of five professionals to seamlessly integrate a data syndication service within the Surefront platform, boosting product listings by 30% and increasing monthly sales revenue by 10% for our clients.
Skills: Python, Django, ReactJS, AWS Lambda, GitHub, CircleCI, Project Management, Risk Management, Quality Assurance, Quality Control, Requirements Gathering, System Design, System Architecture, System Integration, System Testing, System Deployment, System Maintenance
(in collaboration with Surefront)
Data Analysis
Iterative Processing with Spark on a Twitter Social Graph Big dataset
I used Spark for Data Exploratory Analysis on a Twitter Social Network dataset. Then, I implemented and ran the PageRank algorithm on the Twitter dataset to find the most influential users.
Skills: Scala, Zeppelin for Apache Spark, Spark on HDInsight, Databricks, Yarn, Spark UI
AI Engineering
Machine Learning in Production
In the course of this project, I undertook the tasks of implementing, evaluating, operating, monitoring, and evolving a recommendation service tailored for a movie streaming platform akin to Netflix.
Data: 1 million Customers, 27K Movies
Skills: Python, Jupiter Notebook, DVC, Github, Kafka, Minikube, Docker, Load Balancer, FastAPI, MySQL, CircleCI, Grafana, Prometheus
Data Analysis
Exploratory Data Analysis
Utilized Python for data manipulation, analysis, and insight extraction from various complex data formats. Conducted database queries and web scraping to compile a comprehensive list of names for machine learning-driven text analysis. Employed Pandas for ETL tasks, data cleaning, integration, and correlation analysis on sports-related Wikipedia data.
Skills: Python, Panda, ETL, Web Scraping
Web Development
Food Ordering App
Developed a user-friendly food ordering app, uniting React frontend with a Java backend, featuring dynamic menus, order placement, and seamless integration for a smooth customer experience.
Skills: Java, PostgreSQL, JavaScript, CSS, HTML
Cloud Computing
Stream Processing with Kafka and Samza
In this project, I implemented a Kafka Producer to generate streams of data based on provided trace files. These streams simulated real-time driver location updates and client ride requests, which were processed on a remote AWS Samza cluster. Additionally, I wrote Samza code to consume these streams and implemented an algorithm to match clients with drivers based on their scores and proximity, resulting in the successful creation of a driver matching service.
Skills: EC2, Samza cluster, EMR, Kafka, TDD, Terraform