Know ATS Score
CV/Résumé Score
  • Expertini Resume Scoring: Our Semantic Matching Algorithm evaluates your CV/Résumé before you apply for this job role: Senior AI Research Engineer, Model Inference (100% Remote).
Colombia Jobs Expertini

Urgent! Senior AI Research Engineer, Model Inference (100% Remote) Job Opening In Medellín – Now Hiring Tether Operations Limited

Senior AI Research Engineer, Model Inference (100% Remote)



Job description

Join Tether and Shape the Future of Digital Finance

At Tether, we’re not just building products, we’re pioneering a global financial revolution.

Our cutting-edge solutions empower businesses—from exchanges and wallets to payment processors and ATMs—to seamlessly integrate reserve-backed tokens across blockchains.

By harnessing the power of blockchain technology, Tether enables you to store, send, and receive digital tokens instantly, securely, and globally, all at a fraction of the cost.

Transparency is the bedrock of everything we do, ensuring trust in every transaction.

Innovate with Tether

Tether Finance: Our innovative product suite features the world’s most trusted stablecoin, USDT, relied upon by hundreds of millions worldwide, alongside pioneering digital asset tokenization services.

But that’s just the beginning:

Tether Power: Driving sustainable growth, our energy solutions optimize excess power for Bitcoin mining using eco-friendly practices in state-of-the-art, geo-diverse facilities.

Tether Data: Fueling breakthroughs in AI and peer-to-peer technology, we reduce infrastructure costs and enhance global communications with cutting-edge solutions like KEET, our flagship app that redefines secure and private data sharing.

Tether Education: Democratizing access to top-tier digital learning, we empower individuals to thrive in the digital and gig economies, driving global growth and opportunity.

Tether Evolution: At the intersection of technology and human potential, we are pushing the boundaries of what is possible, crafting a future where innovation and human capabilities merge in powerful, unprecedented ways.

Why Join Us?

Our team is a global talent powerhouse, working remotely from every corner of the world.

If you’re passionate about making a mark in the fintech space, this is your opportunity to collaborate with some of the brightest minds, pushing boundaries and setting new standards.

We’ve grown fast, stayed lean, and secured our place as a leader in the industry.

If you have excellent English communication skills and are ready to contribute to the most innovative platform on the planet, Tether is the place for you.

Are you ready to be part of the future?

About the job:

We are looking for an experienced AI Model Engineer with deep expertise in kernel development, model optimization, fine-tuning, and GPU acceleration.

The engineer will extend the inference framework to support inference and fine-tuning for Language models with a strong focus on mobile and integrated GPU acceleration (Vulkan).

This role requires hands-on experience with quantization techniques, LoRA architectures, Vulkan backend, and mobile GPU debugging.

You will play a critical role in pushing the boundaries of desktop and on-device inference and fine-tuning performance for next-generation SLM/LLMs.

Responsibilities:

  • Implement and optimize custom inference and fine-tuning kernels for small and large language models across multiple hardware backends.

  • Implement and optimize full and LoRA fine-tuning for small and large language models across multiple hardware backends.

  • Design and extend datatype and precision support (int, float, mixed precision, ternary QTypes, etc.).

  • Design, customize, and optimize Vulkan compute shaders for quantized operators and fine-tuning workflows.

  • Investigate and resolve GPU acceleration issues on Vulkan and integrated/mobile GPUs.

  • Architect and prepare support for advanced quantization techniques to improve efficiency and memory usage.

  • Debug and optimize GPU operators (e.g., int8, fp16, fp4, ternary).

  • Integrate and validate quantization workflows for training and inference.

  • Conduct evaluation and benchmarking (e.g., perplexity testing, fine-tuned adapter performance).

  • Conduct GPU testing across desktop and mobile devices.

  • Collaborate with research and engineering teams to prototype, benchmark, and scale new model optimization methods.

  • Deliver production-grade, efficient language model deployment for mobile and edge use cases.

  • Work closely with cross-functional teams to integrate optimized serving and inference frameworks into production pipelines designed for edge and on-device applications.

    Define clear success metrics such as improved real-world performance, low error rates, robust scalability, optimal memory usage and ensure continuous monitoring and iterative refinements for sustained improvements.



  • Proficiency in C++ and GPU kernel programming.

  • Proven Expertise in GPU acceleration with Vulkan framework.

  • Strong background in quantization and mixed-precision model optimization.

  • Experience and Expertise in Vulkan compute shader development and customization.

  • Familiarity with LoRA fine-tuning and parameter-efficient training methods.

  • Ability to debug GPU-specific performance and stability issues on desktop and mobile devices.

  • Hands-on experience with mobile GPU acceleration and model inference.

  • Familiarity with large language model architectures (e.g., Qwen, Gemma, LLaMA, Falcon etc.).

  • Experience implementing custom backward operators for fine-tuning.

  • Experience creating and curating custom datasets for style transfer and domain-specific fine-tuning.

  • Demonstrated ability to apply empirical research to overcome challenges in model

Important information for candidates
Recruitment scams have become increasingly common.

To protect yourself, please keep the following in mind when applying for roles:

  • Apply only through our official channels. We do not use third-party platforms or agencies for recruitment unless clearly stated.

    All open roles are listed on our official careers page: https://tether.recruitee.com/

  • Verify the recruiter’s identity. All our recruiters have verified LinkedIn profiles.

    If you’re unsure, you can confirm their identity by checking their profile or contacting us through our website.

  • Be cautious of unusual communication methods. We do not conduct interviews over WhatsApp, Telegram, or SMS.

    All communication is done through official company emails and platforms.

  • Double-check email addresses. All communication from us will come from emails ending in @ or @

  • We will never request payment or financial details. If someone asks for personal financial information or payment at any point during the hiring process, it is a scam.

    Please report it immediately.

When in doubt, feel free to reach out through our official website.


Required Skill Profession

Computer Occupations



Your Complete Job Search Toolkit

✨ Smart • Intelligent • Private • Secure

Start Using Our Tools

Join thousands of professionals who've advanced their careers with our platform

Rate or Report This Job
If you feel this job is inaccurate or spam kindly report to us using below form.
Please Note: This is NOT a job application form.


    Unlock Your Senior AI Potential: Insight & Career Growth Guide


  • Real-time Senior AI Jobs Trends in Medellín, Colombia (Graphical Representation)

    Explore profound insights with Expertini's real-time, in-depth analysis, showcased through the graph below. This graph displays the job market trends for Senior AI in Medellín, Colombia using a bar chart to represent the number of jobs available and a trend line to illustrate the trend over time. Specifically, the graph shows 56716 jobs in Colombia and 554 jobs in Medellín. This comprehensive analysis highlights market share and opportunities for professionals in Senior AI roles. These dynamic trends provide a better understanding of the job market landscape in these regions.

  • Are You Looking for Senior AI Research Engineer, Model Inference (100% Remote) Job?

    Great news! is currently hiring and seeking a Senior AI Research Engineer, Model Inference (100% Remote) to join their team. Feel free to download the job details.

    Wait no longer! Are you also interested in exploring similar jobs? Search now: .

  • The Work Culture

    An organization's rules and standards set how people should be treated in the office and how different situations should be handled. The work culture at Tether Operations Limited adheres to the cultural norms as outlined by Expertini.

    The fundamental ethical values are:
    • 1. Independence
    • 2. Loyalty
    • 3. Impartiality
    • 4. Integrity
    • 5. Accountability
    • 6. Respect for human rights
    • 7. Obeying Colombia laws and regulations
  • What Is the Average Salary Range for Senior AI Research Engineer, Model Inference (100% Remote) Positions?

    The average salary range for a varies, but the pay scale is rated "Standard" in Medellín. Salary levels may vary depending on your industry, experience, and skills. It's essential to research and negotiate effectively. We advise reading the full job specification before proceeding with the application to understand the salary package.

  • What Are the Key Qualifications for Senior AI Research Engineer, Model Inference (100% Remote)?

    Key qualifications for Senior AI Research Engineer, Model Inference (100% Remote) typically include Computer Occupations and a list of qualifications and expertise as mentioned in the job specification. Be sure to check the specific job listing for detailed requirements and qualifications.

  • How Can I Improve My Chances of Getting Hired for Senior AI Research Engineer, Model Inference (100% Remote)?

    To improve your chances of getting hired for Senior AI Research Engineer, Model Inference (100% Remote), consider enhancing your skills. Check your CV/Résumé Score with our free Tool. We have an in-built Resume Scoring tool that gives you the matching score for each job based on your CV/Résumé once it is uploaded. This can help you align your CV/Résumé according to the job requirements and enhance your skills if needed.

  • Interview Tips for Senior AI Research Engineer, Model Inference (100% Remote) Job Success
    Tether Operations Limited interview tips for Senior AI Research Engineer, Model Inference (100% Remote)

    Here are some tips to help you prepare for and ace your job interview:

    Before the Interview:
    • Research: Learn about the Tether Operations Limited's mission, values, products, and the specific job requirements and get further information about
    • Other Openings
    • Practice: Prepare answers to common interview questions and rehearse using the STAR method (Situation, Task, Action, Result) to showcase your skills and experiences.
    • Dress Professionally: Choose attire appropriate for the company culture.
    • Prepare Questions: Show your interest by having thoughtful questions for the interviewer.
    • Plan Your Commute: Allow ample time to arrive on time and avoid feeling rushed.
    During the Interview:
    • Be Punctual: Arrive on time to demonstrate professionalism and respect.
    • Make a Great First Impression: Greet the interviewer with a handshake, smile, and eye contact.
    • Confidence and Enthusiasm: Project a positive attitude and show your genuine interest in the opportunity.
    • Answer Thoughtfully: Listen carefully, take a moment to formulate clear and concise responses. Highlight relevant skills and experiences using the STAR method.
    • Ask Prepared Questions: Demonstrate curiosity and engagement with the role and company.
    • Follow Up: Send a thank-you email to the interviewer within 24 hours.
    Additional Tips:
    • Be Yourself: Let your personality shine through while maintaining professionalism.
    • Be Honest: Don't exaggerate your skills or experience.
    • Be Positive: Focus on your strengths and accomplishments.
    • Body Language: Maintain good posture, avoid fidgeting, and make eye contact.
    • Turn Off Phone: Avoid distractions during the interview.
    Final Thought:

    To prepare for your Senior AI Research Engineer, Model Inference (100% Remote) interview at Tether Operations Limited, research the company, understand the job requirements, and practice common interview questions.

    Highlight your leadership skills, achievements, and strategic thinking abilities. Be prepared to discuss your experience with HR, including your approach to meeting targets as a team player. Additionally, review the Tether Operations Limited's products or services and be prepared to discuss how you can contribute to their success.

    By following these tips, you can increase your chances of making a positive impression and landing the job!

  • How to Set Up Job Alerts for Senior AI Research Engineer, Model Inference (100% Remote) Positions

    Setting up job alerts for Senior AI Research Engineer, Model Inference (100% Remote) is easy with Colombia Jobs Expertini. Simply visit our job alerts page here, enter your preferred job title and location, and choose how often you want to receive notifications. You'll get the latest job openings sent directly to your email for FREE!