Junior Site Reliability Engineer, GNC (Falcon)
Jobright.ai

Jobright is an AI-powered career platform that helps job seekers discover the top opportunities in the US. We are NOT a staffing agency. Jobright does not hire directly for these positions. We connect you with verified openings from employers you can trust.
Job Summary:
SpaceX is actively developing the technologies to enable human life on Mars. The Site Reliability Engineer, GNC will operate and scale mission-critical products for Guidance, Navigation and Control, working closely with the GNC team to maintain and improve GNC-focused tools.
Responsibilities:
• Deploy, upgrade, operate/maintain, and scale a suite of mission-critical GNC products and services
• Provision and maintain virtual and physical servers
• Work with SpaceX HPC team to monitor and maintain a 4000+ thread HPC cluster
• Closely collaborate with GNC software engineers to create highly operable and maintainable products
• Add monitoring for web apps and respond to outages
• Manage the underlying computational infrastructure of GNC in collaboration with IT
• Engage in and improve the whole lifecycle of services: from inception and design, through deployment, operation and refinement
• Make recommendations for future hardware purchases
• Practice sustainable incident response and postmortems
• Provide end-user support to GNC engineering for products by becoming an expert on analysis applications and support users in troubleshooting and pointing to features
• Configure automated deployment pipelines for web apps
• Develop or improve GNC web apps and tools for better usability, maintainability, and robustness
• Demo and document new software changes such as operating system upgrades, shared filesystem changes, or major tool rollouts
• Focus on performance bottlenecks and performance improvement techniques
Qualifications:
Required:
• Bachelor’s degree in computer science, information systems/IT, engineering, math, or scientific discipline and 2+ years of software development experience OR 4+ years of professional experience building software with site reliability or DevOps in lieu of a degree
• Experience with Linux operating systems
• Experience with Python and Python based development frameworks
Preferred:
• 2+ years of systems administration, site reliability engineering, or DevOps experience
• 2+ years of experience with Python and Python-based development frameworks
• 2+ years of Linux experience
• Expertise with Docker, Vagrant, and Kubernetes or similar technologies
• Extensive Experience with configuration management tools such as Ansible, Puppet, Terraform
• Experience with build systems (Make, Bazel / Pants / Buck, Gradle) and package management tools (pip, npm)
• Strong understanding of virtualization and hypervisor technologies
• Understanding of databases and data modeling
• Experience with automatically managing dozens or hundreds of servers
• Strong networking knowledge of TCP/IP
• Experience scaling web applications and optimizing applications for performance
• Professional experience with standard front-end technologies like modern HTML, CSS, JavaScript (we use AngularJS, Polymer, Backbone.js, React, and more), REST, JSON
• Solid understanding of UI/UX design to provide intuitive applications
• Experience with high-performance computing systems or large-scale data analysis systems
• Must be comfortable working with mission-critical and sensitive systems, with a sense of urgency appropriate to the responsibilities
Company:
SpaceX is an aviation and aerospace company that designs, manufactures, and launches rockets and spacecraft. Founded in 2002, the company is headquartered in Hawthorne, California, USA, with a team of 1001-5000 employees. The company is currently Late Stage.
How to apply
To apply for this job you need to authorize on our website. If you don't have an account yet, please register.
Post a resumeSimilar jobs
Learning Experience Designer , Ring Global Learning and Quality

Part Time After School Educator (Fall) (Wiseburn)

Junior Software Engineer (Full Stack)
