Data Infrastructure Engineer
Company: OpenAI
Location: San Francisco
Posted on: February 1, 2025
Job Description:
You'll join the team that's behind OpenAI's data infrastructure
that powers critical engineering, product, and alignment teams that
are core to the work we do at OpenAI. The systems we support
include our data warehouse, batch compute infrastructure, streaming
infrastructure, data orchestration system, data lake, vector
databases, critical integrations, and more.About the RoleThe
Applied Data Platform team designs, builds, and operates the
foundational data infrastructure that enables products and teams at
OpenAI.You are comfortable with work such as scaling Kubernetes
services, OLAP systems, debugging Kafka consumer lag, diagnosing
distributed kv store failures, and designing a system to retrieve
image vectors with low latency.You are well versed with
infrastructure tooling such as Terraform, have worked with
Kubernetes, and possess SRE skill sets.This role is based in San
Francisco, CA. We use a hybrid work model of 3 days in the office
per week and offer relocation assistance to new employees.In this
role, you will:
- Design, build, and maintain data infrastructure systems such as
distributed compute, data orchestration, distributed storage, and
streaming infrastructure while ensuring scalability, reliability,
and security.
- Ensure our data platform can scale reliably to the next several
orders of magnitude.
- Accelerate company productivity by empowering your fellow
engineers and teammates with excellent data tooling and systems,
providing a best-in-class experience.
- Bring new features and capabilities to the world by partnering
with product engineers, trust & safety, and other teams to build
the technical foundations.
- Participate in an on-call rotation to respond to critical
incidents as needed.You might thrive in this role if you:
- Have 4+ years in data infrastructure engineering OR 4+ years in
infrastructure engineering with a strong interest in data.
- Take pride in building and operating scalable, reliable, secure
systems.
- Are comfortable with ambiguity and rapid change.
- Have a voracious and intrinsic desire to learn and fill in
missing skills-and an equally strong talent for sharing learnings
clearly and concisely with others.Some of the technologies you'll
be working with include Apache Spark, Clickhouse, Python,
Terraform, Kafka, Azure EventHub, and Vector DBs.About OpenAIOpenAI
is an AI research and deployment company dedicated to ensuring that
general-purpose artificial intelligence benefits all of humanity.
We push the boundaries of the capabilities of AI systems and seek
to safely deploy them to the world through our products. AI is an
extremely powerful tool that must be created with safety and human
needs at its core, and to achieve our mission, we must encompass
and value the many different perspectives, voices, and experiences
that form the full spectrum of humanity.We are an equal opportunity
employer and do not discriminate on the basis of race, religion,
national origin, gender, sexual orientation, age, veteran status,
disability, or any other legally protected status.For US Based
Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we
will consider qualified applicants with arrest and conviction
records.We are committed to providing reasonable accommodations to
applicants with disabilities, and requests can be made via this
link.At OpenAI, we believe artificial intelligence has the
potential to help people solve immense global challenges, and we
want the upside of AI to be widely shared. Join us in shaping the
future of technology.
#J-18808-Ljbffr
Keywords: OpenAI, Santa Rosa , Data Infrastructure Engineer, Engineering , San Francisco, California
Didn't find what you're looking for? Search again!
Loading more jobs...