Bullish

Senior Engineer, Site Reliability Engineering

Reposted 6 Days Ago

Be an Early Applicant

In-Office

2 Locations

Senior level

In-Office

2 Locations

Senior level

The Senior Site Reliability Engineer will enhance service reliability and efficiency through automation, monitoring, and incident management while collaborating with development teams.

The summary above was generated by AI

The Bullish Group has built an ecosystem focused on developing financial services for the digital assets sector through technology and investment businesses. These include: Bullish Exchange - digital asset trading services that utilize central limit order matching and proprietary market making technology to deliver deep liquidity and tight spreads within a compliant framework. The business is licensed by the Hong Kong Securities and Futures Commission, German Federal Financial Supervisory Authority, and the Gibraltar Financial Services Commission. Since its launch in November 2021, Bullish Exchange has surpassed US$1.3 trillion in total trading volume, with 2H 2024 average daily volume exceeding US$2 billion. Bullish Capital - an investment company which offers strategic capital, industry expertise and an extensive network of resources to support initiatives that connect conventional finance with the revolutionary possibilities of the digital economy. CoinDesk - an award-winning media, events, indices and data business servicing the global crypto economy.

Reports to:

Manager, Site Reliability Engineering

We are seeking a skilled and proactive Site Reliability Engineer to join our team. As an SRE, you will be responsible for maintaining and improving the reliability, scalability, and efficiency of our services. You will work closely with operations and development teams to ensure our systems are robust and performant.

Role & Responsibilities:

System Reliability: Ensure the reliability and availability of our services by implementing best practices and monitoring solutions. Demonstrate reliability with failure injection and testing failovers.
Automation: Automate operational tasks and processes to enhance efficiency and reduce manual intervention. Really embrace the idea of treating “operations as a software problem”.
Monitoring & Performance: Develop and maintain monitoring solutions to track system health and performance. Analyze metrics and logs to identify and resolve issues proactively.
Incident Management: Participate in on-call rotations and respond to incidents, ensuring timely resolution and conducting post-mortem analysis to prevent recurrence.
Collaboration: Work with development and operations teams to integrate reliability into the software lifecycle and provide guidance on best practices.
Capacity Planning: Monitor system capacity and performance data to ensure our infrastructure can scale to meet future demands.
Continuous Improvement: Identify areas for improvement in our systems and processes and implement solutions to enhance reliability and performance.

Experience & Qualifications:

Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
5+ years of experience in a site reliability engineering or operations role.
Proficiency in scripting languages (e.g., Python, Bash) for automation tasks.
Experience with cloud platforms (e.g., AWS, Google Cloud, Azure) and containerization technologies (e.g., Docker, Kubernetes).
Strong understanding of Linux/Unix systems and networking.
Experience with CI/CD pipelines and version control systems (e.g., Git).
Good experience with monitoring and logging tools (e.g., Datadog, Prometheus, Grafana, ELK stack, Otel).
Strong problem-solving skills and attention to detail.
Excellent communication and collaboration skills.
Ability to work in a fast-paced, dynamic environment.
Debugging / Root Cause Analysis

Preferred Qualifications:

Experience with infrastructure as code tools (e.g., Terraform, Ansible).
Knowledge of database systems and data management.
Experience with microservices architecture and distributed systems.

Bullish is proud to be an equal opportunity employer. We are fast evolving and striving towards being a globally-diverse community. With integrity at our core, our success is driven by a talented team of individuals and the different perspectives they are encouraged to bring to work every day.

Top Skills

Ansible

AWS

Azure

Bash

Datadog

Docker

Elk Stack

Git

GCP

Grafana

Kubernetes

Otel

Prometheus

Python

Terraform

Similar Jobs

Crypto.com

Senior Software Engineer

11 Days Ago

Hybrid

Hong Kong

Senior level

Fintech • Financial Services • Cryptocurrency • NFT • Web3

Design, develop, and maintain software for scalable applications. Ensure reliability and performance, lead SRE initiatives, and engage in system design reviews.

Top Skills: AWSDatadogDockerGithub ActionsGitopsGoKubernetesOpentelemetryRubySpaceliftTerraform

JPMorganChase

Securities Services - Global Custody Product Management - Associate

58 Minutes Ago

Hybrid

Kwun Tong, HKG

Senior level

Financial Services

This role involves managing the Global Custody product strategy and client relations, developing business cases, and leading product initiatives in a high-pressure environment.

Top Skills: PowerPoint

JPMorganChase

Business Resiliency Associate, Markets Operations

58 Minutes Ago

Hybrid

Kwun Tong, HKG

Senior level

Financial Services

Responsible for ensuring business resiliency for the Markets sector in Asia Pacific, managing compliance with operational resiliency regulations and improving stakeholder readiness.

What you need to know about the Singapore Tech Scene

The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.