Infrastructure Engineer

Role

We're looking for an Infrastructure Engineer to join our team and lead the design and maintenance of scalable infrastructure to support our desktop application, website, and backend services. You'll be responsible for ensuring high availability, performance, and stability as we scale to handle millions of users and billions of records.

As our team is small and flat, you'll be solely responsible for making key infrastructure decisions and ensuring our systems can handle the demands of real-time AI interactions and massive data processing.

What You'll Do

You'll collaborate directly with the founder (CTO) and engineering team. Here's some of the things you'll do:

  • Design, build, and maintain scalable infrastructure to support Stroke's desktop app, website, and backend services, ensuring high availability and performance
  • Optimize API endpoints to deliver fast and reliable responses, even under high load, to support real-time AI co-creator interactions
  • Set up and manage databases to store and retrieve over 100 million records and various files (user content, AI-generated outputs) efficiently
  • Implement caching mechanisms (Redis) and optimize query performance to reduce latency and improve user experience
  • Ensure secure storage and retrieval of AI response data, user history, and other critical information
  • Monitor system performance, identify bottlenecks, and implement solutions to maintain stability and uptime
  • Collaborate with the engineering team to deploy updates and new features with minimal disruption to users
  • Work with large datasets containing millions and billions of records, ensuring efficient processing and retrieval
  • Implement background synchronization systems for seamless user experience across devices

Tech Stack

  • Cloud infrastructure: AWS (EC2, S3, RDS, Lambda)
  • Backend: Convex for real-time data synchronization
  • Databases: PostgreSQL for complex queries and data relationships
  • Caching: Redis, Memcached for performance optimization
  • API development: REST, GraphQL
  • Monitoring and logging: Datadog, Prometheus, ELK Stack
  • Scripting: Python, Bash for automation
  • Git, CI/CD pipelines, and automated testing frameworks
  • Bonus: Experience with containerization (Docker, Kubernetes) and CI/CD pipelines (GitHub Actions, Jenkins)

In some places the tech stack could be changed. If you have ideas about that, please include them in your application letter.

What We're Looking For

  • Strong experience with cloud infrastructure and scalable system design
  • Proficiency in database optimization and working with large datasets
  • Experience with caching strategies and performance optimization
  • Comfortable working independently with minimal oversight
  • Results-oriented mindset with focus on system reliability and performance

How to Apply

To apply, email us at [email protected] with the following info:

  • Your resume
  • Most challenging infrastructure or scalability problem you resolved and how you tackled it

If there appears to be a fit, we'll get back to you. The final step is a collaborative interview where you'll work on a small infrastructure-related project with the team.