We're in search of an Senior Platform/Infrastructure Engineer to join our team. You'll play a key role in deploying, maintaining, enhancing, and scaling up our high-scale, mission-critical data center production environment, which is essential to our SASE solution. Your responsibilities will include investigating complex production issues across network, OS, Virtualization, Cloud interconnectivity, and hardware components. You'll also focus on improving system resiliency and monitoring tools, as well as automating methods and procedures to streamline operations and boost stability. If you're a skilled Network/DevOps/Data Center/System engineer with experience in large-scale infrastructure environments, and you're looking to advance your career by joining one of the industry leaders in the evolving cybersecurity field, we'd love to have you on our team.
This job is located in Tel Aviv (hybrid).
Key Responsibilities
Plan, deploy, maintian, and optimize the organization's data center infrastructure, including routers, swtiches, firewall, servers and SAN/NAS units.
Create and maintain Infrastructure as Code (IaC) that deploys, scales, configures, and updates Data Center equipment, encompassing servers, storage, switches, and firewalls.
Monitor network and system performance, detect bottlenecks or connectivity issues, and implement necessary solutions.
Debug issues both in production and in labs. Create, test and validate solutions.
Foster collaboration with cross-functional teams to guarantee network security, uphold data integrity, and adhere to industry best practices and regulatory standards.
Participate in On-Call.
Requirements: Over 5 years of hands-on experience in the operations of enterprise SaaS network and server infrastructures
3 years of hands on experience with 24x7x365 production sytems.
Experience with Linux routing daemons such as FRR, BIRD, and GoBGP/ExaBGP.
Strong understanding of modern Linux OS functional components and subsystems
Extensive background in large-scale network/system engineering environments, including ISPs and Cloud Providers.
Hands-on experience with Linux virtualization and containerization technologies, such as KVM, QEMU, Proxmox, and Docker/ECS/K8s
In-depth expertise in Routing protocols including peering and troubleshooting.
Extensive familiarity with Automation, Configuration Management, and Infrastructure as Code (IaC) tools such as Ansible, Terraform, etc. as well as experience running them from within CI/CD tools such as GitHub Actions, Jenkins, etc.
Proficiency in scripting with Bash, Python, or similar languages.
Experience with observability and monitoring systems like, DataDog, Prometheus, Grafana, etc.
Proficient in high availability concepts and implementation, including redundancy, failover, and load balancing technologies.
Additional advantage: Experience in firewall administration, covering configuration, maintenance, and troubleshooting of appliances like Checkpoint.
.המשרה מיועדת לנשים ולגברים כאחד