LLM Engineer (SFT / RLHF / Post-Training)
Achieve Group · Doubaï
Job description
About the role
We are seeking an LLM Engineer to join a leading global tech platform focused on post‑training, reasoning, and alignment of large language models. The role offers the chance to work on cutting‑edge AI systems at massive scale, primarily in a remote setting.
Key responsibilities
- Develop and maintain LLM post‑training pipelines (SFT, DPO, RLHF).
- Improve model reasoning, alignment, and reliability.
- Train and optimise models ranging from 7B to 100B+ parameters.
- Build agent systems, tool‑use, and multi‑step reasoning workflows.
- Operate large‑scale GPU clusters and implement distributed training strategies.
Required profile
- Hands‑on experience with LLM fine‑tuning techniques such as SFT and LoRA.
- Familiarity with RLHF, DPO, PPO, or GRPO methods (any is a plus).
- Experience with models like LLaMA, Qwen, Mistral, or similar.
- Strong background in LLM, NLP, or applied machine‑learning systems.
Required skills
- Python
- PyTorch
- DeepSpeed
- LoRA
- SFT
- RLHF
- DPO
- PPO
- GRPO
- LLaMA
- Qwen
- Mistral
- GPU clusters
- Distributed training
What we offer
- Remote work arrangement.
- Opportunity to work on state‑of‑the‑art LLM systems at global scale.
- Fast‑moving, high‑impact environment.
- Exposure to next‑generation AI research in reasoning, agents, and alignment.
Questions fréquentes
Why are you reporting this job?
Apply in 30 seconds
Enter your email to apply. An account will be created automatically.
By continuing, you accept our terms of use.
Already have an account? Login
Published 5 hours ago
Expires 1 month from now
7 views · 0 applications
Boost your chances
Upload your CV — we will match you with relevant openings.
Analyzing your CV...
Achieve Group
Doubaï
Related job offers
-
Technology Project Manager (6‑month contract)
LanceSoft Middle East Doubaï -
Part‑time IT Executive (Healthcare Experience)
Quttainah Specialized Hospital - Dubai Doubaï -
Remote Rust Engineer – AI Data Training Contractor
YO IT Consulting Doubaï -
Bilingual Virtual Assistant (Arabic & English)
DataAnnotation Émirats arabes unis -
Junior Front-End Developer
PULSEMEDIA (APAC) Ajman