Senior DevOps (Site Reliability Engineer)

JERSEY CITY, NEW JERSEY

About the company:

Founded in 2019, CrossTower is a trading platform founded by capital markets veterans on a mission to mainstream digital asset trading. Our digital-asset platform was methodically built for institutional and individual investors with best-in-class safeguards, services and capabilities as well as innovative pricing to make the next-generation financial markets a reality. Because our leadership team has extensive experience building and managing traditional exchanges and structured products, we understand what’s needed to ensure the CrossTower experience is familiar to market professionals.

About the role:

At CrossTower, the Senior DevOps /Site Reliability Engineer (SRE) plays an instrumental role in building and maintaining the infrastructure for a digital asset financial services company. The passionate candidates will be expected to work closely with his/her Middle DevOps (SRE) peers to build modern infrastructure code to meet the Company’s operation demand to serve millions of customers. Our SRE team will be responsible to improve our customer experiences by ever-increasing availability and performance; reclaiming time spent by our engineers diagnosing issues or configuring software; reducing the total cost of owning and operating products and services. We have a cultural foundation built on diversity, inclusion and innovation and we want you and your ideas to thrive at CrossTower.

What you’ll do:

  • Building and maintenance of resilient and scalable production infrastructure
  • Managing a team of DevOps, control of task deliveries
  • Improvement of monitoring systems
  • Creation and support of development automation processes (CI / CD)
  • Participation in infrastructure development
  • Detection of problems in architecture and proposing of solutions for solving them
  • Creation of tasks for system improvements for system scalability, performance and monitoring
  • Analysis of product requirements in the aspect of DevOps
  • Incident analysis and fixing

What you will bring:

  • 5+ years of Software Engineering Experience
  • Experience of working with relative databases (PostgeSQL), ability to create simple SQL queries
  • Experience of working with highload zero-downtown environments
  • Experience of coding on Python
  • Experience of working with monitoring and metric collect systems (Prometheus, Grafana, Zabbix)
  • Understanding of dynamic routing (OSPF)
  • Experience of working with Solarflare
  • Knowledge and experience of working with network equipment Cisco
  • Experience of working with Cisco NX-OS
  • Experience of working with IPsec, VXLAN, Open vSwitch
  • Knowledge of principles of multicast protocols IGMP, PIM
  • Experience of setting multicast on Cisco equipment
  • Understanding of the distributed systems principles
  • Understanding of principles for building a resistant network infrastructure
  • Experience of Linux administration (Debian-like will be a plus)
  • Strong knowledge of Bash
  • Experience of working with LXC-containers
  • Understanding and experience with infrastructure as a code approach
  • Experience of development idempotent Ansible roles
  • Experience of working with Git

Required Technology Stack:

Linux, Bash, Ansible, LXC, libvirt, IPsec, VXLAN, Open vSwitch, OpenVPN, OSPF, BIRD, Cisco NX-OS, Multicast, PIM, LVM, software RAID, LUKS, PostgreSQL, Nginx, Haproxy, Prometheus, Grafana, Zabbix, GitLab, Capistrano

How to Apply:
If you are interested in joining our team, please send your resume and cover letter to [email protected]

Start Trading on CrossTower Today