Staff Software Engineer Reddit Remote Job | 100k – 200k USD/year

WhatsApp Group Join Now
WhatsApp Channel Follow Us
1/5 - (1 vote)

Reddit is seeking a seasoned Staff Software Engineer to join their Compute Reliability and Efficiency team. This role is ideal for someone who thrives in the intersection of infrastructure and software engineering, particularly with a strong focus on low-level systems like Linux and Kubernetes. You’ll be at the forefront of solving complex intra-cluster engineering challenges that directly impact Reddit’s global platform’s performance, scalability, and reliability.

As a key member of the team, you will collaborate with fellow engineers to design and implement software solutions that enhance Reddit’s infrastructure. Your work will involve in-depth performance analysis, automation of critical development processes, and the development of innovative solutions using Go, Rust, or Python. This position demands a deep understanding of Linux systems, Kubernetes, and distributed systems, ensuring that Reddit’s infrastructure remains robust and efficient.

Your expertise will not only contribute to the technical evolution of Reddit’s computing platform but will also play a vital role in supporting the company’s mission to serve millions of users worldwide. If you have a passion for tackling large-scale infrastructure problems and a track record of success in systems engineering, this opportunity at Reddit could be the perfect fit for you.

Collaborative Development: Work closely with a team of software engineers to design, develop, and maintain Reddit’s foundational infrastructure platform, ensuring it meets the needs of a globally distributed user base.

Performance and Reliability Analysis: Conduct thorough performance evaluations and reliability assessments on Reddit’s Linux-based Kubernetes fleet, identifying areas for improvement and implementing solutions to enhance system stability and efficiency.

Software Design and Implementation: Write and deliver high-quality software in Go (or other languages such as Rust or Python) to improve the availability, scalability, and latency of Reddit’s Compute Platform.

Infrastructure Automation: Automate critical aspects of infrastructure management, including service creation, cluster upgrades, and other essential operations to streamline processes and reduce manual intervention.

Strategic Contribution: Provide feedback and insights on the technical and strategic direction of Reddit’s compute platform, influencing future developments and enhancements.

Kubernetes Integration: Develop and maintain lower-level Kubernetes integrations, such as CSI drivers and kernel-level observability tools, to support Reddit’s complex multi-cloud, multi-region deployments.

On-Call Responsibilities: Share on-call duties with the Compute team, ensuring rapid response and resolution to any infrastructure issues that may arise.

Troubleshooting: Apply strong troubleshooting skills to resolve issues across different layers of Reddit’s infrastructure, from higher-level orchestration concerns to lower-level runtime environments.

  • Extensive Experience: Minimum of 7+ years in infrastructure engineering with a focus on low-level systems such as Linux, demonstrating a deep understanding of system internals and performance optimization.
  • Programming Proficiency: Strong proficiency in Go (preferred), with additional experience in Rust or Python, enabling you to develop robust, high-performance software solutions.
  • Kernel Knowledge: In-depth understanding of kernel primitives, including groups, namespaces, CPU scheduling, and userspace concerns, as well as packet processing, to ensure efficient system operations.
  • Kubernetes Expertise: Hands-on experience with Kubernetes or similar distributed systems, including the ability to develop on top of these platforms and integrate lower-level components.
  • Troubleshooting Skills: Exceptional troubleshooting capabilities, covering high-level orchestration and lower-level runtime issues, allowing for swift identification and resolution of infrastructure problems.
  • System Design: Proven track record in designing large-scale systems, scoping projects, and building consensus among engineering teams to deliver impactful solutions.
  • Communication Skills: Excellent verbal and written communication abilities to collaborate effectively within a service-oriented team and across the company, ensuring alignment and successful project execution.

Apply Online Through:

https://www.reddit.com/

Sharing Is Caring:

Leave a Comment