Job Description
Important Note Before Submitting Your Resume – Please read the Key Responsibilities and Qualifications sections carefully and only apply if you can demonstrate experience directly mapping to the requirement.
The Project – Altech is working on a very interesting global social initiative, an effort to build systems and tools that will facilitate the Observation, Collection, and Understanding of Social Media data. We are building tools (web apps) and be an international resource to study the information ecosystems that can spur evidence-based policy solutions.
Job Title: Data Scientist
Requirements: US Citizenship and Ability to Clear a Drug and Background Check
Location: Remote
Contract: 10 Months (multi-year project but with an initial funding for a period of performance of 10 months)
Hourly Rate: $60-$65/hour + Full Benefits
The ideal candidate will have a solid foundation in Python programming, extensive experience with distributed systems, and a deep understanding of data storage technologies. You should have practical experience in developing solutions using data lakes and data mesh architectures and be proficient with relational databases and Elasticsearch. This role demands a strong background in tuning data systems for enhanced performance and reliability, as well as development experience with machine learning frameworks such as PyTorch and TensorFlow on both CPU and GPU platforms.
Key Responsibilities:
- Develop and implement advanced data analysis, machine learning, and statistical models to solve complex problems and generate actionable insights.
- Work with distributed systems to manage and process large datasets efficiently.
- Design and optimize data storage solutions, utilizing data lakes and data mesh architectures to facilitate scalable data accessibility and analysis.
- Manage relational databases and Elasticsearch for structured data storage and efficient querying.
- Optimize data systems for improved performance and reliability, ensuring smooth operation and quick access to critical information.
- Utilize PyTorch and TensorFlow for developing and deploying machine learning models on both CPU and GPU targets, focusing on text and image processing applications.
- Extract and preprocess data from various sources, including APIs and web scraping, to feed into analytical models.
- Collaborate with cross-functional teams to integrate data science solutions into broader projects and applications.
Qualifications:
- Proficient in Python, with a strong background in software development and data analysis.
- Experienced with distributed systems and their application in data processing and analysis.
- Solid understanding of data storage technologies, with hands-on experience in data lakes and data mesh architectures.
- Familiarity with relational databases and Elasticsearch.
- Proven track record of tuning data systems for performance and reliability.
- Development experience with PyTorch and TensorFlow, particularly for text and image processing tasks.
- Skilled in extracting data from APIs and web scraping.
- Excellent problem-solving abilities and strong analytical skills.