Job Title: Senior Java PySpark Developer
Location: Irving, TX (ONSITE)
F2F Interview
Full Time ONLY
Job Description:
We are seeking a seasoned Senior Lead Data Engineer with a dual-threat background in robust backend development and large-scale data processing. You ae a self-starter who thrives in high-complexity environments and processes the architecture maturity to take a project from a blank page to a production-ready ecosystem without hand-holding.
Responsibilities:
• Design, develop, and maintain robust, scalable, and high-performance applications using Java, PySpark/Python.
• Lead the architecture and design of complex software systems and ensure adherence to best practices.
• Collaborate with product managers, designers, and other developers to define project requirements and deliverables.
• Optimize and refactor existing codebases for performance, scalability, and maintainability.
• Mentor junior developers and provide technical guidance to the team.
• Integrate third-party APIs, libraries, and tools into applications.
• Write unit tests, perform code reviews, and ensure high code quality.
• Work with DevOps teams to deploy and monitor applications in production environments.
• Stay updated with the latest trends and advancements in Java and Python and related technologies.
Qualifications:
• 8+ years of professional experience in Java, PySpark/Python development.
• Strong understanding of Python frameworks such as Django, Flask, or FastAPI.
• Expertise in building Java RESTful APIs and microservices architecture.
• Solid understanding of software design patterns, algorithms, and data structures.
• Hands-on experience with Big Data Environment.
• Familiarity with CI/CD pipelines and version control systems (e.g., Git).
• Strong debugging and problem-solving skills.
• Excellent communication and collaboration skills.
Good to Have Skills:
• Experience with Java and PySpark/Python development creating data processing pipeline and frameworks.
• Knowledge on CICD process to code versioning and deployments in various environment.
• Resource should have good knowledge on Big Data environments and distributed architecture.
• Knowledge on Snowflake is an added advantage.