Job Description
Responsibilities
- Set up and manage cloud-based solution tools and infrastructure
- Develop, deploy, and maintain cloud-based solutions
- Deploy and maintain self-hosted AI models
- Implement automation processes and auto-deployments where possible
- Manage incidents in infrastructure during operation
- Conduct system security assessments
- Monitor system life cycle: onboarding process, roll out updates, releases, etc.
- Perform system reviews and provide improvement recommendations
Minimum Requirements
- Bachelor’s degree in Computer Science, Information Technology, or a related field
- 3+ years of experience in DevOps or a similar role
- Proficiency in working with cloud services, particularly AWS or GCP
- Experience with containerization platforms like Docker
- Familiarity with CI/CD tools and pipelines
- Strong knowledge of Linux/Unix systems
- Experience with scripting languages (e.g., Python, Bash)
- Basic understanding of networking and security principles
- Strong problem-solving skills and attention to detail
- Excellent collaboration and communication skills
Preferred Qualifications
- Experience working with Infrastructure as Code (IaC) tools like Terraform or ARM
- Knowledge of Kubernetes for container orchestration
- Experience working with LLM model serving/inference frameworks
- Proficiency in monitoring, logging, and troubleshooting systems
- Experience with system security assessments and implementations
- Familiarity with agile methodologies
- Relevant certifications (e.g., AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer)
- Experience working in a fast-paced, startup environment
- Knowledge of best practices in DevOps and cloud architecture