Website Artech LLC
Artech LLC
Company : Artech LLC
Location:
Pittsburgh, PA – 100% ONSITE
Salary Range:
Flexible for the right candidate
Introduction
We are seeking a professional with strong expertise in Python programming and Big Data technologies to design, develop, and optimize large-scale data pipelines. This role involves working with distributed systems, processing massive datasets, and enabling advanced analytics for business-critical applications.
Required Skills & Qualifications
- Applicants must be able to work directly for Artech on W2
- Strong Python programming for data processing and automation
- Expertise in the Hadoop ecosystem (HDFS, Hive, Pig, MapReduce)
- Experience with Spark for distributed computing
- Proficiency in SQL and NoSQL databases
Preferred Skills & Qualifications
- Strong analytical and problem-solving skills
- Ability to work in cross-functional teams
Day-to-Day Responsibilities
- Design and implement scalable ETL pipelines using Python and Hadoop tools (Hive, Pig, HDFS)
- Automate data ingestion and transformation for structured and unstructured data
- Develop distributed data processing workflows using Spark, MapReduce, and Hadoop
- Optimize performance for high-volume datasets
- Create efficient data models for analytics and reporting
- Implement best practices for data partitioning and indexing in Hadoop
- Monitor and tune Hadoop clusters for optimal performance
- Ensure high availability and fault tolerance of data systems
- Work closely with data scientists and analysts to enable predictive analytics and machine learning
Company Benefits & Culture
- Inclusive and diverse work environment
- Opportunities for professional growth and development
- Collaborative and supportive team culture
For immediate consideration please click APPLY to begin the screening process with Alex.
