What you’ll be doing
- Maintain health of cloud-based production environments through monitoring and typical daily administration duties.
- Respond to performance issues identified by alerting and other reported incidents.
- Triage incident/problems in an efficient manner.
- Automate operational activities and tasks.
- Perform occasional after hours support.
- Work with Developers to execute software releases, configuration updates, and other release requirements.
- Complete project work as assigned and contribute to the technical direction of various objectives, your ideas will be valued.
- Demonstrate strong interpersonal and communication skills, while working with diverse audiences including highly technical IT professionals, developers, and operators.
- Demonstrate leadership through personal responsibility, accountability, and teamwork.
Who we think you are
- 3+ years experience in managing Windows & Linux servers, and batch job schedulers.
- Bachelor's degree or equivalent experience
- 3+ years experience in IT Operations, troubleshooting and customer support experience.
- Service Engineering and/or DevOps experience at internet scale involving user data and/or software development for an enterprise level service
- Prior work with Cloud Service providers such as AWS and Google
- Experience with Windows Server, CentOS and Ubuntu servers.
- Experience with production Docker environments
- Familiarity with monitoring requirements
- Configuration Management/Automation
- Familiar with best practices for infrastructure security, reliability, and fault tolerance
- Knowledge of MSSQL, MySQL/PostgreSQL, Linux-based systems, Docker, Monitoring systems, web servers will be an advantage.
- Cloud Computing
- Knowledge of AWS / GCP / Microsoft Azure
- Experience with PowerShell, bash scripts.
- Superior problem solving and troubleshooting skills, an ability to use various data collection tools and methodologies to analyze problems and identify solutions
- Demonstrated understanding on Networking Concepts: DNS, VPN, Virtual Networks etc
- Network Protocols/methods: TCP/IP, HTTP/s, JSON.
- Tools – Zabbix & Airflow
- Basic SQL Server Administration concepts
- Ability to work collaboratively with the technical teams and developers
- Logical and critical thinking