Badges
Certifications
Work Experience
Data Engineer
Bank of America• February 2019 - December 2019
● Enhanced Talend pipelines to improve efficiency and ETL development time by up to 40% ● Introduced architecture improvements to Hadoop cluster increasing data capacity 3x ● Improved machine learning models in DataRobot for 10% higher accuracy ● Participated in bank-sponsored CodeWeek 2019, built out MVP (Python, Flask, Elasticsearch, DataRobot) ● Established CI/CD with Jenkins and Bitbucket across four environments ● Implemented Python scripts to handle incoming error-prone datasets, preventing potential data corruption ● Fulfilled data requests for 10+ data scientists and analysts across 4 internal teams
data engineer
American Water• June 2018 - February 2019
● Monitored both Hadoop and NiFi clusters and maintained cluster security and stability ● Conducted migration of data as well as existing processes from one physical server to another ● Mastered Apache NiFi to build end to end data ingestion pipelines from source databases to company HDFS ● Improved development time of ETL pipelines by up to 50% with Python scripts
Big Data Developer
Cognizant Technology Solutions• February 2018 - June 2018
● Developed, updated and executed shellscripts for data ingestion, cleansing and transformation ● Automated ETL processes with Oozie workflow scheduler ● Used Hive, PigLatin, and PySpark to transform data to business needs
Contractor
Cognizant Technology Solutions• 2017
Education
Rutgers, The State University of New Jersey, New Brunswick