Duration: 
 
 
  6 months to start 
   Job Description 
 
 
 
- We are seeking a skilled and proactive Dataiku Administrator to manage and optimize our Dataiku DSS platform deployed in an AWS cloud environment.
- The ideal candidate will have hands-on experience with Apache Spark, cloud-native architecture, and enterprise data governance.
- This role is critical to ensuring the stability, performance, and scalability of our analytics infrastructure, which supports a wide range of data science and business intelligence initiatives.
Platform Administration:
- Manage and maintain the Dataiku DSS platform in AWS, including upgrades, patching, and configuration.
- Monitor system health, performance, and resource utilization across Spark clusters and Dataiku nodes.
- Implement and maintain user access controls, roles, and permissions in alignment with SOX and other compliance requirements.
- Configure and tune Spark execution environments for optimal performance within Dataiku workflows.
- Troubleshoot Spark-related job failures and performance bottlenecks.
- Collaborate with data engineers and scientists to optimize Spark recipes and pipelines.
- Cloud Infrastructure Management:
- Work closely with DevOps and Cloud Engineering teams to manage EC2 instances, S3 buckets, IAM roles, and networking components.
- Develop and maintain automation scripts for platform monitoring, user provisioning, and job scheduling.
- Integrate with logging and alerting tools (e.g., CloudWatch, ELK, Prometheus) to proactively detect and resolve issues.
- Provide technical support and training to Dataiku users across departments.
- Act as a liaison between data teams and IT to ensure platform alignment with organizational goals.
- Strong expertise in Apache Spark, including performance tuning and troubleshooting.
- Hands-on experience with AWS services (EC2, S3, IAM, CloudFormation, etc.).
- Experience with Kubernetes, EMR, or other Spark orchestration tools.
- Proficiency in Python, Bash, and/or other scripting languages.
- Familiarity with CI/CD pipelines and infrastructure-as-code tools (e.g., Terraform, GitLab CI).
- Experience administering Dataiku DSS in a production environment, or
- Dataiku certification (Administrator or Advanced Designer).
- Knowledge of Dataiku plugin development and API integration.
- Experience supporting enterprise-scale analytics platforms.
- Understanding of data governance, security, and compliance frameworks (e.g., SOX, GDPR).

