Systems Engineer

Systems Engineer - Job Listing.png

Systems Engineer

New York, New York

August 2nd 2019




Systems Engineer incorporates aspects of software engineering and applies that to operations with a goal of creating ultra-scalable and highly reliable systems. Involved in all levels of design, development and delivery of these applications and infrastructure.

Help define the evolving standards and processes that facilitate an efficient and reliable environment, implementing immutable infrastructures within internal datacenters and external cloud providers. Manage software changes and releases and be responsible for implementation.

Responsible for providing 24x7 application and technical support to all production and test environments. Application release and configuration management. Pinpoint areas of underdeveloped technology, software practices, and operational procedures that pose a risk to the production environment; actively develop technological solutions and business processes to reduce this risk.


  • Responsible for the Automation of Operations & Support for Trading/Business Systems & Applications

  • Assume a leadership role in the continuous improvement of our Software Deployment Automation (SDA)

  • Research and select products to fulfill system reliability engineering requirements (ex. Tidal, Puppet, Mesos, Git, etc.)

  • Investigate, identify and fix application performance, systemic inadequacies and latent reliability issues

  • Troubleshoot stack-wide engineering issues related to hardware, software, network and application

  • Implement Operational Automation for Monitoring, Managing, Deploying and Validating of Systems/Applications

  • Mentor engineers and team members on methodology, standards and best practices

  • Communicate enhancements through internal presentations, blog articles and documentation

Change Management

  • Manage installation and configuration of application software leveraging SDA

  • Manage, schedule and track application changes and releases

  • Define, test, and document installation and fallback of system changes

  • Asses, manage and communicate risks prior to implementation

  • Document specific procedures for implementation and error checking

Implementation Management

  • Design implementation strategy and procedures

  • Deploy application software and configuration changes across all Production and Test environments utilizing SDA and our version control system

  • Ensure proper system and application health checks are performed and run successfully following implementation

Reliability Testing

  • Create and execute operational test plans/scripts/cases based on technical and business requirements

  • Support the execution of acceptance, performance, capacity, regression and system failover testing on back-end components using automated test procedures wherever possible

  • Support scheduled weekend testing of Production level systems and applications

Application Support

  • Provide 2nd level application support for Test and Production environments

  • Troubleshoot and diagnose systems problems and make repairs in areas of responsibility

  • Utilize Jira, Github and ServiceNow to track and manage defects discovered during testing and application support

  • Troubleshoot and diagnose systems problems; install and configure fixes for applications

  • Provide online and offline Level II support for the trading systems (24 x 7 on call)

  • Analyze cause of production problems and develop methods for improvement

  • Define, execute and implement Systems Reliability Standards and Processes

  • Evaluate, design and implement Automation products and initiatives that lead to higher systems/application reliability

  • Define, execute and implement Change Control and Implementation Management for Application functions

  • Define, execute and analyze Operational Acceptance Test initiatives

  • Provide Level II Application Support



  • Expert knowledge of Linux servers, specifically RHEL

  • Demonstrable knowledge of TCP/IP

  • Experience of high-level scripting language (Python, etc.) and software design and life cycle

  • Practical knowledge and use of source control systems, Preferably GIT

  • Experience with IT Automation Software (Puppet, Chef, SaltStack)

  • Knowledge of basic network administration

  • Operational testing, change control and implementation management experience

  • Practical experience with monitoring and alerting workflows and technologies

  • General understanding of capacity and performance management

  • Possess the ability to adapt and adjust to rapidly changing land-scape and priorities

  • Strong ability to work independently and prioritize tasks with little or no direction

  • Passionate lifelong learner and innovator that understands that current technologies and problems will continue to change

  • Familiarity with open source software development culture, community and workflows


  • Bachelors degree or equivalent knowledge and experience

Print Friendly and PDF

Job ID: A2457