CrawlJobs Logo

Database Reliability Engineer

tiki.vn Logo

TIKI

Location Icon

Location:
Vietnam , Ho Chi Minh

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

As part of Tiki’s Database team, you will be responsible for managing and optimizing our large-scale database infrastructure to ensure the stability, data reliability, integrity, security, and performance of all TIKI services. You will handle database provisioning, performance tuning, high availability, backup/recovery strategies, designing scalable database architectures, access control, and incident troubleshooting to keep our data systems healthy 24/7. Our Systems handle high volumes of transactions and terabytes of data across PostgreSQL, MySQL, MongoDB, ClickHouse, ScyllaDB, and more.

Job Responsibility:

  • Design, implement, and optimize database schemas, indexes, and queries to improve performance, scalability, and reliability
  • Automate provisioning, configuration, access control, and schema deployment using Ansible, Terraform, and Git
  • Monitor database performance metrics, proactively identify bottlenecks, and troubleshoot incidents to ensure system stability and minimize downtime
  • Collaborate closely with developers, DevOps, and infrastructure teams to align database changes with application requirements and optimize SQL queries and access patterns
  • Review and automate database changes execution, deployment processes, user creation, and system permissions
  • Implement and manage backup, recovery, replication, and disaster recovery strategies
  • Proposing cost-effective solutions
  • Set up and manage database partitioning and indexing strategies to efficiently handle large data volumes
  • Conduct regular security audits, patching, and compliance assessments to maintain database security and integrity
  • Automate routine DBA tasks and implement monitoring solutions to ensure database health and availability
  • Support Change Data Capture (CDC) pipelines using Kafka Connect or similar tools
  • Apply Infrastructure-as-Code and CI/CD practices for version-controlled database configurations
  • Participate in on-call rotations and promptly respond to database-related emergencies outside business hours
  • Document database configurations, procedures, and troubleshooting guidelines for knowledge sharing and compliance

Requirements:

  • 3+ years operating *nix systems in production (CentOS, Rocky, Ubuntu, Debian)
  • 3+ years managing databases in large-scale environments
  • Expertise in at least one major RDBMS (PostgreSQL or MySQL)
  • Deep understanding of internals - storage engines, indexing, replication, and transactions
  • Proven skills in performance tuning and capacity planning
  • Familiarity with high-availability, scaling, and Kubernetes-based deployments
  • Experience with observability tools (Prometheus, Grafana)
  • Exposure to CDC, Terraform/Ansible, and Git-based CI/CD workflows
  • Understanding of networking fundamentals (TCP/IP, routing, firewalls)
  • Strong troubleshooting & problem-solving mindset
  • open and collaborative
  • Available for off-hour support when needed

Nice to have:

  • Experience with NoSQL (Cassandra, ScyllaDB) and data streaming (Kafka)
  • Familiarity with data warehouses (BigQuery) or cloud DB services (Cloud SQL, RDS/Aurora)
  • Exposure to hybrid / multi-environment setups

Additional Information:

Job Posted:
January 10, 2026

Expiration:
February 10, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Database Reliability Engineer

Database Reliability Engineer

The Database Reliability Engineer (DBRE) is responsible for managing, building, ...
Location
Location
United States
Salary
Salary:
120000.00 - 179000.00 USD / Year
pointclickcare.com Logo
PointClickCare
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of experience working with relational database systems
  • Strong hands-on experience with MySQL (administration, performance tuning, replication, HA/DR)
  • 1+ years in a DBRE or database-focused engineering role
  • Experience working in cloud environments (AWS, GCP, or Azure — Azure preferred)
  • Coding and automation experience (Python, PowerShell, SQL, etc.)
  • Experience with Infrastructure-as-Code tools such as Ansible and Terraform
  • Experience working with source control systems such as Git
  • MySQL experience preferred
  • PostgreSQL is a plus
  • Experience working with VLDBs (1+ TB) and managing large database fleets (100+ instances)
Job Responsibility
Job Responsibility
  • Managing, building, maintaining, monitoring, and troubleshooting the cloud-based MySQL database infrastructure that our mission-critical SaaS application depends on
  • Focuses heavily on automation and coding to reduce operational toil
  • Collaborate closely with Engineering and SRE teams to support new product development and ensure reliable database integration across the platform
  • Work on observability of MySQL database metrics and ensure database performance and reliability objectives are consistently met
  • Work with the DBA team to identify areas of operational toil and implement automations/processes to manage PCC’s MySQL database systems at scale
  • Apply a data-driven approach to performance tuning, availability improvements, and operational optimization
  • Provide database support to Engineering and SRE teams, including review of database migrations, query performance, schema/design improvements, and standardizing MySQL configuration and deployment patterns
  • Assist the DBA team with performance troubleshooting and root-cause analysis
What we offer
What we offer
  • Benefits starting from Day 1!
  • Retirement Plan Matching
  • Flexible Paid Time Off
  • Wellness Support Programs and Resources
  • Parental & Caregiver Leaves
  • Fertility & Adoption Support
  • Continuous Development Support Program
  • Employee Assistance Program
  • Allyship and Inclusion Communities
  • Employee Recognition … and more!
  • Fulltime
Read More
Arrow Right

Database Reliability Engineer - Core Team

We are committed to providing our customers with reliable and secure services at...
Location
Location
United Kingdom
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science or a related field
  • At least 5 years of experience in Reliability Engineering, QA or customer facing engineering
  • Previous experience operating ClickHouse or other SQL databases in production
  • Excellent understanding of distributed database internals and SQL, particularly ClickHouse is a major plus
  • Scripting experience with Shell or Python, and ability to read and understand C++ code
  • Knowledge of cloud computing platforms such as AWS, Azure, or Google Cloud Platform
  • You are a strong problem-solver and have solid production debugging skills
  • You thrive in a fast-paced environment as part of a global team, and you see yourself as a partner with the business with the shared goal of moving the business forward
  • You have a high level of responsibility, ownership, and accountability
  • Excellent communication skills
Job Responsibility
Job Responsibility
  • Continuously improve the reliability and performance of ClickHouse core
  • Improve and create metrics and alerts for ClickHouse to be able to identify and prevent problems in production before they affect customers
  • Dig deeper into the most common problems encountered by customers in Clickhouse Core to identify the root cause of problems and submit bug fixes, issue reports and suggest improvements
  • Enhance and refine incident response processes and post-mortem analysis for ClickHouse core related outages including working with support and Cloud teams to communicate to the impacted customers
  • Plan, enable, and drive Chaos initiatives across Engineering teams, based upon internal priorities
  • Manage on-call processes to respond to performance and reliability issues, and establish best practices for coordinating escalation to resolve issues and minimize customer impact
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right

Database Reliability Engineer

We are committed to providing our customers with reliable and secure services at...
Location
Location
Netherlands
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science or a related field
  • At least 5 years of experience in Reliability Engineering, QA or customer facing engineering
  • Previous experience operating ClickHouse or other SQL databases in production
  • Excellent understanding of distributed database internals and SQL, particularly ClickHouse is a major plus
  • Scripting experience with Shell or Python, and ability to read and understand C++ code
  • Knowledge of cloud computing platforms such as AWS, Azure, or Google Cloud Platform
  • You are a strong problem-solver and have solid production debugging skills
  • You thrive in a fast-paced environment as part of a global team, and you see yourself as a partner with the business with the shared goal of moving the business forward
  • You have a high level of responsibility, ownership, and accountability
  • Excellent communication skills
Job Responsibility
Job Responsibility
  • Continuously improve the reliability and performance of ClickHouse core
  • Improve and create metrics and alerts for ClickHouse to be able to identify and prevent problems in production before they affect customers
  • Dig deeper into the most common problems encountered by customers in Clickhouse Core to identify the root cause of problems and submit bug fixes, issue reports and suggest improvements
  • Enhance and refine incident response processes and post-mortem analysis for ClickHouse core related outages including working with support and Cloud teams to communicate to the impacted customers
  • Plan, enable, and drive Chaos initiatives across Engineering teams, based upon internal priorities
  • Manage on-call processes to respond to performance and reliability issues, and establish best practices for coordinating escalation to resolve issues and minimize customer impact
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
  • Fulltime
Read More
Arrow Right

Database Reliability Engineer

We are committed to providing our customers with reliable and secure services at...
Location
Location
Germany
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science or a related field
  • At least 5 years of experience in Reliability Engineering, QA or customer facing engineering
  • Previous experience operating ClickHouse or other SQL databases in production
  • Excellent understanding of distributed database internals and SQL, particularly ClickHouse is a major plus
  • Scripting experience with Shell or Python, and ability to read and understand C++ code
  • Knowledge of cloud computing platforms such as AWS, Azure, or Google Cloud Platform
  • You are a strong problem-solver and have solid production debugging skills
  • You thrive in a fast-paced environment as part of a global team, and you see yourself as a partner with the business with the shared goal of moving the business forward
  • You have a high level of responsibility, ownership, and accountability
  • Excellent communication skills
Job Responsibility
Job Responsibility
  • Continuously improve the reliability and performance of ClickHouse core
  • Improve and create metrics and alerts for ClickHouse to be able to identify and prevent problems in production before they affect customers
  • Dig deeper into the most common problems encountered by customers in Clickhouse Core to identify the root cause of problems and submit bug fixes, issue reports and suggest improvements
  • Enhance and refine incident response processes and post-mortem analysis for ClickHouse core related outages including working with support and Cloud teams to communicate to the impacted customers
  • Plan, enable, and drive Chaos initiatives across Engineering teams, based upon internal priorities
  • Manage on-call processes to respond to performance and reliability issues, and establish best practices for coordinating escalation to resolve issues and minimize customer impact
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right

Database Reliability Engineer

We are looking for a skilled and motivated Database Reliability Engineer to join...
Location
Location
United States , Irvine; Los Angeles
Salary
Salary:
130000.00 - 150000.00 USD / Year
viantinc.com Logo
Viant
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 2–5 years of experience in database administration in production environments
  • Experience with relational databases such as MySQL, PostgreSQL, or SQL Server
  • Hands-on exposure to AWS (e.g., RDS, Aurora) and/or GCP (e.g., Cloud SQL, BigQuery)
  • Experience with Linux systems and cloud monitoring tools (e.g., CloudWatch, Stackdriver)
  • Proficient in scripting (e.g., Bash, Python) and automation tools
  • Familiar with CI/CD and infrastructure automation (e.g., Terraform, GitHub Actions, Jenkins)
  • Hands-on experience with Grafana and Prometheus for database and infrastructure monitoring
  • Understanding of backup and recovery strategies, replication, and high availability
  • Basic knowledge of performance tuning and monitoring tools (e.g., EverSQL)
  • Strong analytical and troubleshooting skills
Job Responsibility
Job Responsibility
  • Database Maintenance and Operations - Maintain database health by managing backups, replication, and routine maintenance tasks across environments (e.g., MySQL, PostgreSQL, SQL Server)
  • Cloud Database Support - Assist with administration of cloud-based databases such as AWS RDS, Aurora, DynamoDB, and Google Cloud SQL, ensuring reliability and performance
  • Monitoring and Alerting - Set up and maintain monitoring and alerting systems using Prometheus and Grafana, as well as cloud-native tools (e.g., CloudWatch, Stackdriver) to proactively detect and resolve database issues
  • Performance Tuning - Collaborate with senior DBAs and developers to optimize queries, indexes, and configurations for better performance
  • Automation and Scripting - Automate recurring tasks using scripts and contribute to deployment pipelines and database change management processes
  • Security and Access Management - Implement role-based access controls, audit trails, and enforce best practices for data security and compliance
  • Documentation and Support - Document database configurations, procedures, and incident reports. Provide support during incidents and collaborate with engineers to troubleshoot issues
What we offer
What we offer
  • fully paid health insurance
  • paid parental leave
  • unlimited PTO
Read More
Arrow Right

Database Reliability Engineer IV

PagerDuty is seeking a proficient Senior Database Reliability Engineer (DBRE) IV...
Location
Location
United States , Atlanta
Salary
Salary:
150000.00 - 252000.00 USD / Year
https://www.pagerduty.com Logo
PagerDuty
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in SRE, DBRE, or Software Development
  • 3+ years experience with database management systems such as MySQL, PostgreSQL, DynamoDB, Cassandra, etc.
  • Experience in one or more of the following languages like Ruby, Python, or Golang
  • Experience working on cloud-native infrastructure in AWS
  • Experience working with a container scheduler platform, preferably Kubernetes
Job Responsibility
Job Responsibility
  • Partner with Engineering stakeholders to design and deliver reliable, scalable, secure, and performant data platforms
  • Continuously strive to improve the customer experience: Full lifecycle support (creation, development, deployment, retirement), observability, flexible connectivity, and monitoring
  • Stay current on technology trends in order to deliver innovative tools and approaches to interesting problems
  • Share your expertise with the entire Engineering organization
  • Participate in a 24/7 on-call rotation
What we offer
What we offer
  • Competitive salary
  • Comprehensive benefits package
  • Flexible work arrangements
  • Company equity
  • ESPP (Employee Stock Purchase Program)
  • Retirement or pension plan
  • Generous paid vacation time
  • Paid holidays and sick leave
  • Dutonian Wellness Days & HibernationDuty - companywide paid days off in addition to PTO
  • Paid parental leave: 22 weeks for pregnant parent, 12 weeks for non-pregnant parent
  • Fulltime
Read More
Arrow Right

Database Engineer / DBA (Database Migration)

NorthBay is seeking talented and motivated Database Engineers who are passionate...
Location
Location
Pakistan , Lahore / Islamabad / Karachi
Salary
Salary:
250000.00 - 350000.00 PKR / Month
northbaysolutions.com Logo
NorthBay
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Up to 3 years of relevant experience in database deployment, management, and migration
  • Hands-on experience with PostgreSQL, MongoDB/DocumentDB, and AWS DMS preferred
  • Knowledge of Backup/Recovery, Clustering, Sharding, Replication, and CDC concepts
  • Familiarity with NoSQL databases (especially DocumentDB) is a plus
  • Understanding of data pipelines, ETL/ELT processes, and data engineering principles is an advantage
  • Experience in database monitoring, troubleshooting, performance diagnostics, and optimization
  • Strong understanding of OLTP and OLAP workloads
  • Excellent problem-solving and technical communication skills
Job Responsibility
Job Responsibility
  • Collaborate with technical leads to plan and execute database migrations successfully
  • Evaluate, design, and implement effective migration strategies ensuring data integrity and reliability
  • Understand database structures, schemas, dependencies, and security configurations (RBAC, Backup/Recovery)
  • Develop and execute validation plans to ensure post-migration performance and accuracy
  • Work closely with application and infrastructure teams to ensure business functionality and SLAs are achieved
  • Provide recommendations for performance tuning and optimization of databases
What we offer
What we offer
  • Competitive salary and performance-based benefits
  • Fuel expense reimbursement
  • Paid holidays and vacation
  • Medical outpatient reimbursement and health insurance coverage
  • A collaborative, high-performance work environment with opportunities to make a meaningful impact
  • Fulltime
Read More
Arrow Right

Senior Database Engineer

We’re looking for a skilled Data Reliability Engineer to join our team for a cli...
Location
Location
United States
Salary
Salary:
Not provided
zoolatech.com Logo
Zoolatech
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in Data Engineering, Database Reliability, or Infrastructure Operations
  • Strong expertise in PostgreSQL on AWS, including tuning, replication, backups, and HA configurations
  • Experience operating RDBMS databases (PostgreSQL, MySQL, etc.) and Kubernetes technologies is highly desirable
  • Experience provisioning and operating NoSQL databases at scale like Elasticsearch, Elastic Cache, DynamoDB, Neo4j, Mongo, Cassandra, etc.
  • Advanced SQL scripting and query optimization skills
  • Experience with data systems monitoring, alerting, and performance tuning
  • Strong programming/scripting in Java, Python, or Shell
  • Proven experience in designing or supporting complex data ecosystems
  • Solid understanding of cloud infrastructure (preferably AWS) and Infrastructure as Code tools (Terraform)
  • Familiarity with event streaming platforms (Kafka), and observability stacks (New Relic, ELK, etc.)
Job Responsibility
Job Responsibility
  • Own and optimize the reliability, availability, and performance of data infrastructure across production systems
  • Lead the design and implementation of resilient, secure, and observable data systems
  • Collaborate with SRE, Security, and Engineering teams to enforce data infrastructure standards and align on architectural decisions
  • Design and implement automation around provisioning, uptime monitoring, data refresh, integrity, backups, and disaster recovery
  • Support application developers with performance tuning, complex query optimization, and database design reviews
  • Analyze and resolve performance bottlenecks and incidents with a focus on long-term solutions
  • Participate in on-call rotation to support production systems and ensure high availability
  • Actively contribute to improving incident response and observability through metrics, alerting, and runbooks
  • Work with technologies such as Java, Ruby on Rails, PostgreSQL, AWS, Kafka, S3, Elasticsearch
What we offer
What we offer
  • Paid Vacation
  • Sick Days
  • Floating Holidays
  • Sport/Insurance Compensation
  • English Classes
  • Charity
  • Training Compensation
  • Fulltime
Read More
Arrow Right