[ref. g52890602] Language Data Scientist - Manila
Innodata Manila Full-time
Roles and Responsibilities
As a Language Data Scientist, your role involves managing, consulting, and engaging with customers on process improvements in LLM training data synthesis, validation, and annotation. Advise and support the BU Head on engaging with the customer to understand the upstream activities that would be performed using the service of Innodata Inc.
Responsibilities:
- Dive deep into existing workflows and processes to gather data and insights, make recommendations, and drive improvement through innovation and cross-functional collaboration with customers.
- Critically assess annotation tooling and workflows.
- Quantitatively analyse large datasets, perform statistical analysis, calculate metrics, and make recommendations to improve accuracy and performance.
Minimum Education Requirements and Skills:
- MA in (computational) Linguistics, Data Science, Computer Science (AI / ML / NLU), or a related scientific / quantitative field. PhD strongly preferred.
- Strong knowledge of data structures, algorithms, and data engineering principles.
- Experience with Natural Language Processing (NLP) techniques and tools, such as SpaCy, NLTK, or Hugging Face.
- Proficiency in Python to handle / transform large datasets (e.g. pre- and post-processing data), to perform quantitative analyses, and to visualize data.
- Possess excellent problem-solving skills, with the ability to think critically and creatively to develop innovative AI propositions.
- Model Fine-Tuning: Knowledge of Fine-tune pre-trained models to adapt them to specific tasks and datasets, improving their performance and relevance.
- Data Engineering and Pipelines: Deep understanding of data pipelines to support ML and NLP workflows, knowledge of efficient data collection, transformation, and storage.
- Continuous Improvement: Updated with the latest advancements in ML and NLP technologies.
Viventis Search AsiaManila
AVAILABLE ROLES FOR DATA SCIENTIST
1. Senior Data Scientist NLP & Vision
• 5+ years in industry; projects in NLP and vision
• NLP, computer vision, deep learning, ML models, PyTorch, TensorFlow
• Familiar with LLM orchestration whether custom...
Synapsewerx Pty LtdManila
Job Description
Description
Synapsewerx is seeking a highly skilled and experienced Senior Data Scientist / AI Agent Engineer withsignificant experience in building, training and tuning Modelsand a solid foundation in Generative AI Models...
InnodataQuezon City, 10 km from Manila
Roles and Responsibilities
As a Language Data Scientist, your role involves managing, consulting, and engaging with customers on process improvements in LLM training data synthesis, validation, and annotation. Advise and support the BU Head...
Best jobs you don't want to miss: