The Site Reliability Engineer will ensure the continuous operation of the Linux-based trading infrastructure, providing second-level support, developing automated server management solutions, and collaborating with engineering teams. Responsibilities include responding to emergencies, managing core services, and participating in on-call rotations.
Tower Research Capital, a leading high-frequency proprietary trading firm founded in 1998, seeks a skilled Site Reliability Engineer to join our global SRE Team in Singapore.
Responsibilities
- Overseeing and ensuring the continuous operation of the firm's Linux-based trading infrastructure, addressing day-to-day operational needs
- Providing second-level support, including:
- Rapid response to emergencies
- Implementing scheduled updates and deployments
- In-depth analysis and resolution of performance issues
- Engage in a rotational on-call schedule, including early morning and weekend shifts, to provide timely support
- Contributing towards the development of automated solutions for server provisioning, configuration, and monitoring, targeting a scalable management of thousands of servers
- Engaging in interactions with the Trading and Core Engineering teams
- Managing essential Core services such as DHCP, LDAP, DNS, and NFS for on-prem and hosted data centers as well as public clouds
- Participating in an on-call rotation and occasional weekend shifts
Qualifications
- Sound expertise in Linux production environments
- Basic knowledge of Python and Bash scripting
- Engagement with automation and monitoring tool sets
- Comprehensive knowledge of operating system principles, with a particular focus on Linux internals
- Familiarity with Intel-based server hardware and components
- Competence in server-side networking, including understanding network protocols and configurations
- Familiarity in cloud services and architectural solutions
- Experience in designing, building, and troubleshooting complex systems
- Good problem-solving skills, underpinned by a methodical approach to technical challenges. This includes an ability to communicate effectively, demonstrating strong interpersonal skills, a sense of responsibility, and a commitment to driving projects to completion.
- Sense of ownership and drive
Preferred Qualifications
- Involvement in open source or personal projects showcasing a passion for innovation and collaboration
- Experience in High Frequency Trading, Quantitative Finance or working in low latency environment is advantageous but not a strict requirement
Candidates Attributes
- Organized, responsible, and meticulous
- Strong communicator
- Proactive and willing to take initiative
- Able to manage and prioritize multiple tasks
- Excellent at supporting Linux Production environments
- Able to work both within a team and independently
Benefits
Tower continues to enhance the in-house trading system and strategies that have positioned the firm as a leader in the thriving field of quantitative trading. While Tower offers challenges and rewards rivaling those of any Wall Street firm, Tower’s cubicle-free workplace, jeans-clad workforce, and well-stocked kitchens reflect the premium the firm places on quality of life. Benefits include:
- Competitive salary and discretionary bonuses
- 5 weeks of paid vacation per year
- Breakfast, lunch, and snacks on a daily basis
- International medical insurance
- Free gym membership
- For employees ineligible to participate in the CPF, the cash equivalent of the employer’s CPF contribution
- Free events and workshops
- Donation matching program
Tower Research Capital is an equal opportunity employer.
Top Skills
Bash
Python
Similar Jobs
Information Technology • Software • Financial Services
The Site Reliability Engineer will manage first line support in a distributed environment, assist with performance tuning, handle application migrations and infrastructure upgrades, and provide capacity planning and deployment support, while fostering collaboration and problem-solving with team members.
Top Skills:
Bash,Python
2 Days Ago
Easy Apply
Easy Apply
Cloud • Security • Software • Cybersecurity • Automation
As an Intermediate Site Reliability Engineer at GitLab, you will focus on automating and maintaining a large number of GitLab environments, operational tasks, and ensuring services run smoothly. You'll develop monitoring systems, collaborate across teams, and implement security measures while responding promptly to user emergencies and support requests.
Top Skills:
GoRuby
Information Technology • Software • Financial Services
Site Reliability Engineers (SRE) are responsible for ensuring the reliability and performance of applications. They work on automation, support application development, provide incident management, and collaborate with development teams to innovate and improve systems. SREs also design and deploy production applications while utilizing best practices in technological and business strategies.
Top Skills:
Python
What you need to know about the Singapore Tech Scene
The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.