This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We’re looking for a Cloud Operations Engineer I to join our Infrastructure Team and help support the backbone of how we deliver for our clients. You’ll work hands-on in our AWS environments every day, helping provision and tune systems, digging into performance issues, and supporting the integrations and configurations that keep projects moving through implementation, testing, and launch. You’ll work closely with our Service Delivery and Product teams, jumping in wherever needed to make sure environments are stable, secure, and ready for the next milestone. If you’re someone who picks things up quickly, follows runbooks but isn’t afraid to suggest improvements, and can juggle ongoing and incoming requests with a calm, organized approach while using AI tools to troubleshoot, automate, and work smarter, you’ll thrive here. At SpryPoint, our reputation is built on the reliability of our products and the quality of our client experience, and our Infrastructure team plays a big part in making that happen.
Job Responsibility:
Support incoming requests for environment provisioning, IP whitelisting, domain whitelabelling, integrations, access, database refresh coordination, and general troubleshooting through Jira Service Management
Provision and maintain AWS environments using Elastic Beanstalk, EC2, ECS, RDS PostgreSQL and Aurora Serverless v2, DynamoDB, Route 53, VPC networking, and S3
Investigate infrastructure and application performance issues using observability tools, logs, metrics, and Linux-level debugging
Review database metrics and connection patterns to identify performance concerns and escalate deeper SQL or indexing issues to development teams when needed
Support project teams through onboarding, testing cycles, mock go-lives, and scheduled production go-lives
Take part in scheduled maintenance work including patching, scaling, certificate updates, and configuration adjustments
Help maintain and tune monitoring and alerting to ensure issues are identified proactively
Assist with incident investigations and contribute to follow-up actions and root cause analysis when issues impact client environments
Build or extend automation using Python or Bash to streamline recurring operational workflows
Work with NGINX logs, service logs, and application logs to help identify and resolve issues across the stack
Document workflows, environment procedures, and runbooks clearly in Confluence to support consistency and knowledge sharing across the team
Keep accurate operational records in Jira, including change management updates, time tracking, and environment notes, as part of our delivery and compliance processes
Apply security best practices when managing environments, including IAM permissions, patching, access controls, and compliance considerations
Participate in routine backup validation, environment restores, or DR practice exercises when needed
Communicate clearly with internal teams and occasionally with client-facing teams to help unblock issues and keep delivery moving
Use AI tools to speed up troubleshooting, improve documentation, and enhance overall operational efficiency
Requirements:
Experience with AWS services such as EC2, Elastic Beanstalk, ECS, RDS PostgreSQL, Route 53, VPC networking, and S3
Comfort working in Linux environments with strong troubleshooting instincts
Comfort working with relational databases, ideally PostgreSQL, including reviewing metrics, understanding connection patterns, and spotting early indicators of performance issues
Practical scripting experience with Python and Bash
Ability to balance and prioritize a steady stream of requests across multiple active projects
Clear, concise communication skills and a collaborative approach with project teams
Curiosity, adaptability, and motivation to learn new tools and technologies
Interest in using AI tools to accelerate learning, troubleshoot efficiently, and enhance day-to-day workflows
Nice to have:
Experience with observability or monitoring platforms
Familiarity with NGINX and reverse proxy configuration
Background supporting project-based or client-delivery teams
Awareness of cloud cost considerations and environment optimization
What we offer:
Remote-first environment with flexible working hours across North America
Competitive Total Rewards - Comprehensive compensation package that grows with you
Complete Setup - MacBook + $500 to create your ideal home workspace
Total Wellness - Health, dental, vision, and life insurance from day one
Generous PTO, Summer Friday half-days, and unlimited sick days
RRSP (Canada) and 401k (US) matching programs
$2,500 annual development fund, tuition assistance, and Book Bounty program
Annual company events and team offsites that bring us together