Not Applicable
Posted March 14, 2026
Job link
Thinking about this job
Responsibilities
Commitments
Responsibilities
- This is a hands-on, applied ML role where your work ships directly into the product.
- Embedding models for code and natural language, including training, fine-tuning, and evaluation
- Clustering algorithms for grouping semantically related data (e.g., agglomerative clustering, similarity scoring)
- Token-level classification models for intelligent text compression
- Retrieval-augmented generation (RAG) pipelines, including vector search and adaptive response strategies
- LLM integration for summarization, headline generation, and conversational interfaces
- Evaluation frameworks and metrics for ML pipeline quality Requirements
Commitments
Comfort working in a fast-paced startup environment with high autonomy
Based in or willing to relocate to New York City Nice to Have
We consider all qualified applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, disability, veteran status, genetic information, or any other characteristic protected by applicable federal, state, or local law, including the New York City Human Rights Law.
Not Met Priorities
What still needs stronger evidence
Requirements
- Similarity-based matching systems with dynamic threshold tuning
- Evaluation frameworks and metrics for ML pipeline quality Requirements
- 2+ years of professional experience in applied machine learning or ML engineering
- Strong experience with embedding models and vector similarity search
- Hands-on experience building and deploying clustering or classification pipelines
- Familiarity with large language models and prompt engineering for production use cases
- Proficiency in Python; comfort working with ML frameworks (PyTorch, scikit-learn, or similar)
- Experience with vector databases (e.g., Qdrant, Pinecone, Weaviate, or pgvector)
- Comfort working in a fast-paced startup environment with high autonomy
- Based in or willing to relocate to New York City Nice to Have
- Experience applying ML to code understanding, code search, or developer tools
- Familiarity with Rust or willingness to work alongside a Rust-heavy codebase
- Experience with token-level or span-level NLP tasks (NER, compression, extraction)
- Background in information retrieval or search ranking systems
Preferred Skills
- Similarity-based matching systems with dynamic threshold tuning
- Evaluation frameworks and metrics for ML pipeline quality Requirements
- Proficiency in Python; comfort working with ML frameworks (PyTorch, scikit-learn, or similar)
- Experience with vector databases (e.g., Qdrant, Pinecone, Weaviate, or pgvector)
- Comfort working in a fast-paced startup environment with high autonomy
- Based in or willing to relocate to New York City Nice to Have
- Experience applying ML to code understanding, code search, or developer tools
- Familiarity with Rust or willingness to work alongside a Rust-heavy codebase
- Experience with token-level or span-level NLP tasks (NER, compression, extraction)
- Background in information retrieval or search ranking systems
About Merget Merget is an early-stage startup in New York City building next-generation developer infrastructure. We're a small, technical team working on hard problems at the intersection of version control, AI-assisted development, and developer experience. If you care deeply about building intelligent systems that solve real engineering problems and want to shape a product from the ground up, we'd love to hear from you. The Role We're looking for an ML Engineer to own the machine learning and semantic intelligence layer of our platform. You'll design and build the models and pipelines that make our tools smart—embedding code changes, clustering related work, compressing information for efficient processing, and powering natural language interfaces over structured data. This is a hands-on, applied ML role where your work ships directly into the product. What You'll Work On
Embedding models for code and natural language, including training, fine-tuning, and evaluation
Clustering algorithms for grouping semantically related data (e.g., agglomerative clustering, similarity scoring)
Token-level classification models for intelligent text compression
Retrieval-augmented generation (RAG) pipelines, including vector search and adaptive response strategies
LLM integration for summarization, headline generation, and conversational interfaces
Similarity-based matching systems with dynamic threshold tuning
Evaluation frameworks and metrics for ML pipeline quality Requirements
2+ years of professional experience in applied machine learning or ML engineering
Strong experience with embedding models and vector similarity search
Hands-on experience building and deploying clustering or classification pipelines
Familiarity with large language models and prompt engineering for production use cases
Proficiency in Python; comfort working with ML frameworks (PyTorch, scikit-learn, or similar)
Experience with vector databases (e.g., Qdrant, Pinecone, Weaviate, or pgvector)
Comfort working in a fast-paced startup environment with high autonomy
Based in or willing to relocate to New York City Nice to Have
Experience applying ML to code understanding, code search, or developer tools
Familiarity with Rust or willingness to work alongside a Rust-heavy codebase
Experience with token-level or span-level NLP tasks (NER, compression, extraction)
Background in information retrieval or search ranking systems
Contributions to open-source ML projects Salary: $ 100,000 - $180,000 (The posted salary range is provided in compliance with New York City's Pay Transparency Law, Local Law 32.) Merget is an equal opportunity employer. We consider all qualified applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, disability, veteran status, genetic information, or any other characteristic protected by applicable federal, state, or local law, including the New York City Human Rights Law.
Embedding models for code and natural language, including training, fine-tuning, and evaluation
Clustering algorithms for grouping semantically related data (e.g., agglomerative clustering, similarity scoring)
Token-level classification models for intelligent text compression
Retrieval-augmented generation (RAG) pipelines, including vector search and adaptive response strategies
LLM integration for summarization, headline generation, and conversational interfaces
Similarity-based matching systems with dynamic threshold tuning
Evaluation frameworks and metrics for ML pipeline quality Requirements
2+ years of professional experience in applied machine learning or ML engineering
Strong experience with embedding models and vector similarity search
Hands-on experience building and deploying clustering or classification pipelines
Familiarity with large language models and prompt engineering for production use cases
Proficiency in Python; comfort working with ML frameworks (PyTorch, scikit-learn, or similar)
Experience with vector databases (e.g., Qdrant, Pinecone, Weaviate, or pgvector)
Comfort working in a fast-paced startup environment with high autonomy
Based in or willing to relocate to New York City Nice to Have
Experience applying ML to code understanding, code search, or developer tools
Familiarity with Rust or willingness to work alongside a Rust-heavy codebase
Experience with token-level or span-level NLP tasks (NER, compression, extraction)
Background in information retrieval or search ranking systems
Contributions to open-source ML projects Salary: $ 100,000 - $180,000 (The posted salary range is provided in compliance with New York City's Pay Transparency Law, Local Law 32.) Merget is an equal opportunity employer. We consider all qualified applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, disability, veteran status, genetic information, or any other characteristic protected by applicable federal, state, or local law, including the New York City Human Rights Law.