Role : Python and PySpark, No SQL Developer
Location : MT. Laurel, NJ (ONSITE)
FTE ONLY
Job Description
Must Have Technical/Functional Skills
• 7+ years of software or data engineering experience.
• Strong programming experience with Python (Pandas, NumPy preferred).
• Hands-on experience with PySpark for distributed data processing.
• Deep expertise with MongoDB (NoSQL operations, pipelines, indexing strategies).
• Strong proficiency in SQL with the ability to write complex queries and optimize performance.
• Experience working with large datasets and scalable big data pipelines.
• Familiarity with cloud platforms (AWS/Azure/GCP) is an advantage.
• Strong analytical and problem solving skills.
• Excellent communication and teamwork capabilities.
Roles & Responsibilities
• Develop and maintain data pipelines using Python and PySpark.
• Design and optimize ETL processes for large-scale data workflows.
• Work extensively with MongoDB for schema design, querying, indexing, and data modeling.
• Write complex SQL queries, stored procedures, and performance-tuned datasets.
• Collaborate with data engineers, analysts, and business teams to understand requirements.
• Ensure data quality, consistency, and governance throughout the development lifecycle.
• Troubleshoot and resolve performance issues across Python, Spark, and database layers.
• Participate in Agile ceremonies and contribute to solution architecture discussions.
Generic Managerial Skills, If any
• Knowledge of CI/CD tools (Git, Jenkins, Azure DevOps).
• Experience with data lakes, Delta Lake, or Lakehouse architectures.
• Exposure to containerization (Docker, Kubernetes).