Turing.com review by ex-entrepreneur

"You never have to apply for another job once you’re a Turing developer"

- Harsh, full-stack developer from India

Harsh, a former tech entrepreneur, shared his Turing.com review and talked about how the company bridges the gap in employment opportunities across the globe by providing developers with better quality of work, culture, compensation, and exposure.

Harsh sharing his experience working as a Turing Developer

Meet Harsh

Harsh is a full-stack developer based out of New Delhi, India. He has been a part of the Turing family for a year now.

Harsh has more than five years of professional experience in web-related technologies like JavaScript, Python, and Golang. He has deployed several applications on Docker and Kubernetes.

Life before Turing jobs

Talking about his career trajectory, Harsh notes: "I'd been working on my startup for four years, yet I felt I was missing something. I didn't know what. But I always knew that I wanted to learn how world-class tech companies scaled their businesses."

Drawing inspiration from this, Harsh began looking for relevant employment opportunities. It soon became apparent that there weren't enough high-quality jobs in his local area, especially for the technologies that interested him.

"I didn't want to relocate away from my family. I wanted decent pay that justified the amount of effort I was putting in," he shares.

And thus, Harsh began looking for jobs that met his criteria.

How did he learn about Turing US jobs?

Harsh stumbled on an ad from Turing on one of his job hunts.

"When I went to the website, I noticed that their platform was different from the other remote gig websites. It offered long-term opportunities to developers," he adds.

"I registered on the website and took a couple of very well-designed tests. It was apparent that Turing had created these tests to select the best of the developers. I was able to clear them thanks to my extensive experience," he smiles.

How has journey with Turing.com been so far?

"I work with top US firms as a part of the Turing family, and I do that without even having to leave my room. Turing bridges the gap in employment opportunities across the world by giving developers better quality of work, culture, exposure, and pay," Harsh states.

What's his take on Turing developers?

The Delhi-based engineer believes that the best part about being a developer with Turing is the freedom to choose the place and time of work. "Turing developers can be anywhere in the world and still be able to work," he adds.

But that's not all. Talking about how the organization provides the best opportunities to its developers, Harsh explains: "Turing has an incredible matching team. They work with developers to find their interests and then match them with the right company, which, I feel, is unique! On other platforms, developers have to compete against each other to get gigs."

What's the final verdict?

"This organization takes care of everything, right from professional growth, timely payments, communication tools, and feedback to even fostering a sense of community. If you, too, are looking for better opportunities and a better quality of life, join me today. Once a part of Turing, you never have to apply to another job," he concludes.

Interested in U.S. software jobs?

Apply to Turing today.

Apply now

Explore remote developer jobs

briefcase
AI Quality Analyst - Portuguese (Portugal)

About Turing:
Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers in two ways: first, by accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who specialize in coding, reasoning, STEM, multilinguality, multimodality, and agents; and second, by applying that expertise to help enterprises transform AI from proof of concept into proprietary intelligence with systems that perform reliably, deliver measurable impact, and drive lasting results on the P&L.

Role Overview:

As an AI Quality Analyst, you will evaluate a new personalization feature for Gemini. You will assess how well the model uses information from your past Gemini conversations, Gmail, Google Search, and YouTube activity to make responses more relevant and helpful. This role requires a unique blend of creativity and analytical rigor. You will actively design prompts from the perspective of your own personal experiences. You will then use your analytical skills to assess the quality of the model's personalized responses, evaluating dimensions like Grounding, Integration, and Helpfulness.


Key Qualifications

  • Portugueese Proficiency: Ability to read and write in Portuguese with a high degree of comp, as Portuguese is the focus language for this project.
  • Personal Account Usage: Willingness to use your primary personal Google account (not a testing account) and enable personal data sources for a genuine assessment.
  • Schedule Flexibility: Full-time availability in your local time zone is required.  We are staffing a global, 24-hour operations team.
  • Exceptional Analytical Thinking: Demonstrate ability to evaluate nuanced and ambiguous AI responses, specifically assessing personalization quality.
  • Creative Prompt Engineering: Experience in designing creative, multi-turn starting prompts based on personal context to thoroughly test the model's capabilities.
  • Strong Evaluation Acumen: Understanding of personalization concepts, including the ability to identify incorrect personalization, poor inferences, and forced connections.
  • Meticulous Attention to Detail: The ability to review Side-by-Side (SxS) model responses and spot subtle differences in naturalness and overnarrating.
  • Excellent Written Communication: Superior ability to write clear, concise, and structured rationales for model rankings, explicitly referencing specific turn numbers.
  • Feedback: Ability to provide constructive feedback and detailed annotations.
  • Communication: Excellent communication and collaboration skills.
  • Independence: Self-motivated and able to work independently in a remote setting.
  • Technical Setup: Desktop/Laptop set up with a good internet connection.


Description:

  • In this role, you will be part of a dynamic team focused on evaluating the quality of personalized AI interactions. Your day-to-day work will involve:
  • Designing and executing multi-turn conversational prompts (typically 1-5 turns) that require the AI to utilize your personal information and experiences.
  • Evaluating model responses based on your intent from the starting prompt, checking if the personalization was appropriately applied.
  • Analyzing responses for Grounding issues, ensuring claims about you are supported by evidence and not flawed inferences or hallucinations.
  • Assessing Integration quality to ensure personal data is woven naturally into the response without robotic "overnarrating".
  • Rigorously evaluating and stack-ranking two model responses side-by-side (SxS) to determine which is overall more helpful, easy to use, and enjoyable.
  • Writing clear, defensible rationales for your comparisons, explicitly referencing where issues or positive aspects occurred in the conversation.
  • Extracting and verifying "Debug Info" from the model to confirm that chat summaries and data sources were properly utilized.
  • Maintaining strict data hygiene by deleting evaluation conversations to prevent them from polluting your future chat history.


Education & Experience

  • BS/BA degree or equivalent experience in a relevant field (e.g., Policy, Law, Ethics, Linguistics, Journalism, Computer Science, or a related analytical field).
  • Experience in data annotation, AI quality evaluation, content moderation, or a related role is strongly preferred.

Offer Details:

  • Commitments Required: at least 4 hours per day and upto 40 hours per week with 4 hours of overlap with PST.
  • Engagement type: Contractor
  • Engagement Length: 3 months
  • Our offered rate for this project is $15 per hour.

Evaluation Process -

  • Shortlisted candidates will be sent a Job Interest Form.
  • After the profile review, an assessment will be shared, which must be completed within 24 hours.
  • Based on the assessment outcomes, shortlisted candidates will be contacted to discuss the pre‑onboarding requirements.
Software
10K+ employees
Domain-Specific Languages
briefcase
AI Engineer

About Turing


Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers in two ways: first, by accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who specialize in coding, reasoning, STEM, multilinguality, multimodality, and agents; and second, by applying that expertise to help enterprises transform AI from proof of concept into proprietary intelligence with systems that perform reliably, deliver measurable impact, and drive lasting results on the P&L


Role Overview


We are looking for an AI/ML Engineer specializing in LLM post-training and reinforcement learning workflows. The role focuses on fine-tuning open-weight models, building reward systems, and improving model performance through scalable training, evaluation, and data curation


What does day-to-day life look like?

  • Design and execute fine-tuning pipelines for open-weight models (Qwen, Llama, Mistral families) using SFT → DPO → GRPO progressions on tool-use and agentic data.
  • Implement and tune LoRA / QLoRA adapters for parameter-efficient fine-tuning; understand when full fine-tuning vs PEFT is the right call.
  • Build reward functions and verifiers for RL training  including programmatic verifiers, LLM-as-judge rubrics, and state-transition checks against gym environments.
  • Generate, curate, and filter RL tool-use training data: golden trajectories, preference pairs, on-policy rollouts, and rejection-sampled completions.
  • Run distributed training on multi-GPU setups; manage inference at scale with vLLM (including extended-context configurations via YaRN / RoPE scaling).
  • Diagnose failure modes: reward hacking, distribution collapse, KL blow-up, tool-selection errors vs state-transition errors, format drift.
  • Define and track evaluation metrics  pass@k, pass^k, trajectory-level scoring, rubric-based vs binary scoring  and own model-quality reporting against benchmarks.
  • Partner with annotation, eval, and client teams to translate data-quality signals into training improvements.

Requirements

  • 3+ years of hands-on ML engineering experience, with at least 1+ year specifically on LLM post-training.
  • Demonstrated production or research experience with at least three of: SFT, LoRA/QLoRA, DPO, PPO, GRPO, RLHF.
  • Strong PyTorch fundamentals; working familiarity with Hugging Face TRL, Accelerate, DeepSpeed or FSDP, and vLLM.
  • Experience designing reward signals or verifiers for RL training  not just running training scripts.
  • Solid grasp of tokenization, attention, chat templates, tool-calling formats (OpenAI/Anthropic-style), and common failure modes in agent training.
  • Comfort with Python, distributed training, GPU profiling, and reading research papers and turning them into working code.

Strongly Preferred:


  • Experience training tool-use or agentic models (function calling, multi-step tool selection, planner-executor patterns).
  • Experience with synthetic data generation pipelines and rejection sampling.
  • Familiarity with MCP, LangChain/LangGraph, or similar agent frameworks.
  • Exposure to evals at scale: building harnesses, designing rubrics, dealing with judge variance and reward hacking.
  • Cloud/infra: RunPod, AWS, GCP; container workflows; long-context inference tuning.


Perks of Freelancing With Turing

  • Work in a fully remote environment.
  • Opportunity to work on cutting-edge AI projects with leading LLM companies.

Offer Details

  • Commitments Required: 40 hours per week with overlap of 4 hours with PST. 
  • Engagement Type: Contractor assignment (no medical/paid leave)
  • Duration of contract : 2 months; [expected start date is next week]
  • Location: India, Pakistan, Bangladesh, Brazil

Evaluation Process

  • 2 rounds of Technical Interview (90 mins)
-
1-10 employees
PythonMachine Learning
sample card

Apply for the best jobs

View more openings

Work full-time at top U.S. companies

Create your profile, pass Turing Tests and get job offers as early as 2 weeks.