Judge Jobs
1 - 15 of 186
Search Results - Judge Jobs
HDFC securities-Bangalore-
or equivalents.
• LLM-as-a-judge pipelines — including knowing the failure modes (judge bias, position bias, verbosity bias) and how to mitigate them.
• Regression testing for prompts, models, and tool chains. Catching silent quality drift between model versions...
Read more
Alternative Path-India-
skills. Ability to document findings, communicate status, and outline next steps & finding ways to improve the existing documentation.
• Ability to judge when issues should be escalated to L2 support
• Good in troubleshoot the issues and finding...
Read more
HDFC securities-Bangalore-
and adversarial prompts. Experience curating golden datasets and maintaining them as the product evolves.
Hands-on with evaluation frameworks: Langfuse, Promptfoo, DeepEval, RAGAS, OpenAI Evals, LM-Eval-Harness, or equivalents.
LLM-as-a-judge pipelines — including...
Read more
Accion Labs-Sāngli-
few-shot strategies, and judge-based evaluation
• Integrate and manage LLM routing via LiteLLM: model fallback, cost control, and per-route configuration
• Design agentic workflows using LangGraph: multi-step retrieval, tool use, and conditional...
Read more
Alternative Path-Mumbai-
findings, communicate status, and outline next steps & finding ways to improve the existing documentation.
• Ability to judge when issues should be escalated to L2 support
• Good in troubleshoot the issues and finding the root cause as per the defined SOP...
Read more
Accion Labs-Nagpur-
few-shot strategies, and judge-based evaluation
• Integrate and manage LLM routing via LiteLLM: model fallback, cost control, and per-route configuration
• Design agentic workflows using LangGraph: multi-step retrieval, tool use, and conditional...
Read more
KNORR-BREMSE TECHNOLOGY CENTER INDIA PRIVATE LIMITED-Coimbatore-
Implement sophisticated memory management (short-term state and long-term RAG) and reasoning strategies (ReAct, Reflection) to reduce hallucinations.●AgentOps & CI/CD: Develop and maintain automated 'LLM-as-a-Judge' evaluation suites within the CI/CD...
Read more
Accion Labs-Pune-
few-shot strategies, and judge-based evaluation
• Integrate and manage LLM routing via LiteLLM: model fallback, cost control, and per-route configuration
• Design agentic workflows using LangGraph: multi-step retrieval, tool use, and conditional...
Read more
KNORR-BREMSE TECHNOLOGY CENTER INDIA PRIVATE LIMITED-Madurai-
Implement sophisticated memory management (short-term state and long-term RAG) and reasoning strategies (ReAct, Reflection) to reduce hallucinations.●AgentOps & CI/CD: Develop and maintain automated 'LLM-as-a-Judge' evaluation suites within the CI/CD...
Read more
Accion Labs-Pune-jmmst.com-
few-shot strategies, and judge-based evaluation
• Integrate and manage LLM routing via LiteLLM: model fallback, cost control, and per-route configuration
• Design agentic workflows using LangGraph: multi-step retrieval, tool use, and conditional...
Read more
KNORR-BREMSE TECHNOLOGY CENTER INDIA PRIVATE LIMITED-Salem-
Implement sophisticated memory management (short-term state and long-term RAG) and reasoning strategies (ReAct, Reflection) to reduce hallucinations.●AgentOps & CI/CD: Develop and maintain automated 'LLM-as-a-Judge' evaluation suites within the CI/CD...
Read more
Alternative Path-Champhai-
skills. Ability to document findings, communicate status, and outline next steps & finding ways to improve the existing documentation.
• Ability to judge when issues should be escalated to L2 support
• Good in troubleshoot the issues and finding...
Read more
Alternative Path-Kolasib-
skills. Ability to document findings, communicate status, and outline next steps & finding ways to improve the existing documentation.
• Ability to judge when issues should be escalated to L2 support
• Good in troubleshoot the issues and finding...
Read more
Alternative Path-Serchhīp-
skills. Ability to document findings, communicate status, and outline next steps & finding ways to improve the existing documentation.
• Ability to judge when issues should be escalated to L2 support
• Good in troubleshoot the issues and finding...
Read more
Alternative Path-Mumbai-
findings, communicate status, and outline next steps & finding ways to improve the existing documentation.
• Ability to judge when issues should be escalated to L2 support
• Good in troubleshoot the issues and finding the root cause as per the defined SOP...
Read more
29 similar jobs: Ajmer, Pānīpat, Lucknow, Guwahati, Vizianagaram...
12345678910
Companies now hiring:
More jobs – Legal:
Don’t miss out on new job vacancies!
Create a job alert for: Judge
It's free, and you can cancel email updates at any time
12345678910