πŸ“ Toronto, Ontario  Β·  Seneca College Β· 2025

Saroj Neupane

Data Engineer Β· IT Specialist  Β·  Cloud
Data Engineering AWS & Azure Python / SQL IT Support Embedded / IoT PCB Design

I'm a Computer Engineering graduate from Seneca College with 8+ years of professional experience in IT support and data infrastructure. I've worked at Orca Intelligence as Senior Data Engineer, Cedar Gate Technologies as Data Engineer, and at Verisk Analytics as Data Engineer.

On the data engineering side, I architect end-to-end pipelines using Python, SQL, and AWS β€” ingesting, transforming, and serving data at scale. I hold 7 certifications including CCNA, CompTIA Security+, AWS Cloud Practitioner, MS Azure, and Google IT Support.

πŸ“ Toronto, Canada
πŸŽ“ Seneca College Β· 2025
πŸ’Ό 8+ yrs Experience
πŸ† 7 Certifications
πŸ“ž 416-731-2534
8+
Yrs Experience
7
Certifications
6+
Projects
Saroj Neupane β€” Data Engineer, Toronto
Available · Toronto 🍁
About Me

Skills & Background

I'm Saroj Neupane, a Data Engineer and Computer Engineering Technician based in Toronto, Ontario. I build scalable data pipelines, cloud infrastructure, and embedded systems β€” bridging hardware and modern software.

My 8+ years of professional data engieering and computer engineering experience at Orca Intelligence Verisk Analytics and Cedar Gate Technologies gave me deep expertise in IT support, Active Directory, Microsoft 365, and enterprise networking. I hold 7 professional certifications and am actively pursuing data engineering and cloud roles in Canada.

Data Engineering
Python / Pandas SQL / PostgreSQL ETL Pipelines Apache Airflow Kafka dbt PySpark Data Warehousing API Integration NumPy
Cloud & Infrastructure
AWS EC2 / S3 AWS RDS / CloudFront AWS IAM / Lambda Microsoft Azure Azure AD CI/CD Pipelines Docker
IT Support & Systems
Active Directory Microsoft 365 ServiceNow Jira Service Mgmt AnyDesk / RDP VPN Troubleshooting MFA / SSO Endpoint Security Apache Kafka Tensorflow
Programming
Python C / C++ C# Bash / Shell SQL HTML / CSS Flask OpenCV
Networking & Operating Systems
TCP/IP & DNS/DHCP Cisco CCNA Wireshark Windows 10/11 & Server Linux / Ubuntu macOS
Hardware & Embedded
Arduino Raspberry Pi PCB Design OrCAD Vectorworks Oscilloscope Hardware Repair
πŸŽ“ Education
Computer Engineering
Seneca College
Toronto, Ontario  Β·  Graduated 2025
Data Engineering Cloud Computing Network Admin PCB Design IT Support
πŸ† Certifications
🌐
Cisco CCNA
Networking & IT Infrastructure
πŸ›‘οΈ
CompTIA Security+
Cybersecurity Fundamentals
☁️
AWS Cloud Practitioner
Amazon Web Services
πŸ”·
MS Azure Fundamentals
Microsoft Azure Certification
πŸͺŸ
MS Cloud Practitioner
Microsoft Cloud Fundamentals
πŸ”§
Google IT Support
Google Professional Certificate
⚑
OrCAD Certification
PCB & Circuit Design
Career

Work Experience

8+ years of professional experience across enterprise IT environments in Canada, USA, and Nepal.

Senior Data Engineer
Orca Intelligence Inc.
πŸ“ London, Ontario, Canada
Feb 2025 – Present
  • Designed and developed scalable ETL/ELT pipelines using Azure Databricks, PySpark, and Azure Data Factory
  • Built enterprise-grade lakehouse architectures using Azure Data Lake Gen2 and Snowflakefor analytics and AI workloads.
  • Developed high-performance Spark jobs processing large-scale healthcare datasets and streaming pipelines using Kafka and Spark Streaming
  • Automated workflow orchestration and SLA management using Apache Airflow DAGs
  • Built CI/CD pipelines using Azure DevOps, Jenkins, Docker, and Terraform
  • Implemented data quality validation frameworks and collaborated with analytics teams for AI/ML data preparation.
Environment: Azure Databricks, PySpark, Spark SQL, Azure Data Factory, Snowflake, Kafka, Airflow, Python, SQL, Docker, Terraform, Jenkins, CI/CD
Data Engineer
Cedar Gate Technologies
πŸ“ Greenwich, Connecticut, USA
July 2022 – Jan 2025
  • Developed scalable batch and incremental ETL pipelines using PySpark and Azure Databricks
  • Implemented automated ingestion workflows using Azure Data Factory and Apache Airflow
  • Designed dimensional data models and Snowflake schemasfor healthcare analytics systems.
  • Built reusable PySpark transformation frameworks and real-time ingestion pipelines using Kafka and Azure Event Hub
  • Optimized Spark workloads using partitioning, caching, and adaptive query execution techniques.
  • Created monitoring and alerting solutions using Grafana and CloudWatch
Environment: Azure Databricks, PySpark, Azure Data Factory, Apache Airflow, Snowflake, Kafka, Event Hub, Spark SQL, Python, SQL Server, Grafana, CloudWatch
Data Engineer
Verisk Analytics
πŸ“ Jersey City, New Jersey, USA
March 2020 – June 2022
  • Developed end-to-end ETL pipelines using AWS Glue, PySpark, and Python for insurance analytics data
  • Built scalable big data processing solutions using AWS EMR, Spark, and Snowflake
  • Automated Airflow workflows for daily and hourly batch processing pipelines.
  • Integrated APIs, relational databases, flat files, and third-party insurance feeds into centralized analytics platforms.
  • Implemented data quality validation, reconciliation, and audit frameworks for enterprise reporting.
  • Optimized AWS cloud resources and Spark workloads to improve performance and reduce infrastructure costs.
Environment: AWS Glue, AWS EMR, S3, Redshift, Snowflake, Apache Airflow, PySpark, Spark SQL, Python, SQL, Kafka, AWS Lambda, Docker, Jenkins
Work

Featured Projects

Engineering projects spanning AI cloud systems, data pipelines, robotics, and embedded systems.

πŸ”„
β˜… Featured
Data Engineering Β· Real-Time
Data Pipeline Project

High-throughput data pipeline for real-time ingestion, transformation, and loading. Features stream processing with error handling, retry logic, data quality checks, and monitoring dashboards on AWS.

Python SQL AWS S3 / RDS Stream Processing Data Quality Orchestration
πŸ₯…
β˜… Featured
Robotics Β· Embedded Systems
Automatic Goalkeeper Robot

Autonomous robot detecting incoming balls via ultrasonic and IR sensors. Arduino C++ firmware achieves sub-120ms reaction time through custom motor driver circuits and multi-sensor fusion. Full hardware design from scratch.

Arduino C++ Ultrasonic Sensor IR Sensor Motor Driver Embedded C

More on GitHub

All source code and additional projects on my public GitHub profile.

View GitHub β†—
Get in touch

Let's work
together

Data engineering, cloud deployment, IT support, or Python automation β€” available for full-time roles, internships, and freelance work.

Connect with me

I respond within 24 hours. Open to opportunities in Canada and remotely.

πŸ“ž
Phone
416-731-2534
βœ‰οΈ
Email
sarojneupane114@gmail.com
πŸ“
Location
Toronto, Ontario, Canada
🌐
Website
sarojneupaneofficial.com
Send a Message
I'll reply to your email within 24 hours.
Message sentβœ…!

Thanks for reaching out. I'll reply to within 24 hours.

πŸ€–
Saroj's Assistant
● Online
Hey! πŸ‘‹ Ask me about Saroj's projects, experience, skills, or contact info!