Senior GenAI Machine Learning Engineer; Business Intelligence
Company: Databricks Inc.
Location: San Francisco
Posted on: February 1, 2025
Job Description:
At Databricks, we are obsessed with enabling data teams to solve
the world's toughest problems. We do this by building and running
the world's best data and AI infrastructure platform, so our
customers can focus on the high value challenges that are central
to their own missions. Founded in 2013 by the original creators of
Apache Spark, Databricks has grown from a tiny corner office in
Berkeley, California to a global organization with over 1000
employees. Thousands of organizations, from small to Fortune 100,
trust Databricks with their mission-critical workloads, making us
one of the fastest growing SaaS companies in the world.You'll work
with teams across Databricks to conduct foundational research into
the feasibility and effectiveness of solutions that help customers
analyze data using natural language, and then bring those solutions
into our products to make data analysis easier and more
approachable for all of our customers. More broadly, our teams work
on some of the hardest, most interesting problems facing the
business, ranging from designing large-scale distributed AI/ML
systems, to optimizing distributed GPU model serving to developing
novel modeling methodologies that scale to production use cases.The
impact you will have:
- Shape the direction of our applied ML areas and intelligence
features in our products, helping customers translate unstructured
text into structured code, queries and data.
- Drive the development and deployment of state-of-the-art AI
models and systems that directly impact the capabilities and
performance of Databricks' products and services.
- Architect and implement robust, scalable ML infrastructure,
including data storage, processing, and model serving components,
to support seamless integration of AI/ML models into production
environments.
- Develop novel data collection, fine-tuning, and pre-training
strategies that achieve optimal performance on specific tasks and
domains.
- Design and implement automated ML pipelines for data
preprocessing, feature engineering, model training, hyperparameter
tuning, and model evaluation, enabling rapid experimentation and
iteration.
- Implement advanced model compression and optimization
techniques to reduce the resource footprint of language models
while preserving their performance.
- Contribute to the broader AI community by publishing research,
presenting at conferences, and actively participating in
open-source projects, enhancing Databricks' reputation as an
industry leader.What we look for:
- PhD in Computer Science, strongly preferred, or a related field
or equivalent practical experience.
- 2-5 years of machine learning engineering experience in
high-velocity, high-growth companies. Alternatively, a strong
background in relevant ML research in academia will be considered
as an equivalent qualification.
- Experience developing AI/ML systems at scale in production or
in high-impact research environments.
- Strong track record of working with language modeling
technologies. This could include the following: Developing
generative and embedding techniques, modern model architectures,
fine tuning / pre-training datasets, and evaluation
benchmarks.
- Strong coding and software engineering skills, and familiarity
with software engineering principles around testing, code reviews
and deployment.
- Experience deploying and scaling language models in production;
deep understanding of the unique infrastructure challenges posed by
training and serving LLMs.
- Strong understanding of computer science fundamentals.
- Prior experience with Natural Language Processing and
transforming unstructured text into structured code, queries and
data is a plus.
- Contributions to well-used open-source projects.Please note we
are open to employees working from our Mountain View, CA office for
this position.Pay Range TransparencyDatabricks is committed to fair
and equitable compensation practices. The pay range(s) for this
role is listed below and represents base salary range for
non-commissionable roles or on-target earnings for commissionable
roles. Actual compensation packages are based on several factors
that are unique to each candidate, including but not limited to
job-related skills, depth of experience, relevant certifications
and training, and specific work location. Based on the factors
above, Databricks utilizes the full width of the range. The total
compensation package for this position may also include eligibility
for annual performance bonus, equity, and the benefits listed
above.Local Pay Range:$166,000 - $210,250 USDAbout
DatabricksDatabricks is the data and AI company. More than 10,000
organizations worldwide - including Comcast, Cond-- Nast,
Grammarly, and over 50% of the Fortune 500 - rely on the Databricks
Data Intelligence Platform to unify and democratize data, analytics
and AI. Databricks is headquartered in San Francisco, with offices
around the globe and was founded by the original creators of
Lakehouse, Apache Spark, Delta Lake and MLflow.BenefitsAt
Databricks, we strive to provide comprehensive benefits and perks
that meet the needs of all of our employees. For specific details
on the benefits offered in your region, please visit .Our
Commitment to Diversity and InclusionAt Databricks, we are
committed to fostering a diverse and inclusive culture where
everyone can excel. We take great care to ensure that our hiring
practices are inclusive and meet equal employment opportunity
standards. Individuals looking for employment at Databricks are
considered without regard to age, color, disability, ethnicity,
family or marital status, gender identity or expression, language,
national origin, physical and mental ability, political
affiliation, race, religion, sexual orientation, socio-economic
status, veteran status, and other protected
characteristics.ComplianceIf access to export-controlled technology
or source code is required for performance of job duties, it is
within Employer's discretion whether to apply for a U.S. government
license for such positions, and Employer may decline to proceed
with an applicant on this basis alone.
#J-18808-Ljbffr
Keywords: Databricks Inc., Santa Rosa , Senior GenAI Machine Learning Engineer; Business Intelligence, Engineering , San Francisco, California
Didn't find what you're looking for? Search again!
Loading more jobs...