Inference

Remote

Distributed GPU cluster for LLM inference

Welcome to Inference

Inference is a distributed GPU cluster based on Solana, designed for Large Language Model (LLM) inference. It provides fast, scalable, and token-based payment APIs for models such as DeepSeek V3 and Llama 3.3.

Inference's Reviews

Dec 23, 2025

Zhang San

Môi trường làm việc tại Inference khá ổn định, nhưng chế độ đãi ngộ chưa thực sự hấp dẫn so với kỳ vọng của tôi.

Dec 05, 2025

YK Doe

The company offers a good opportunity for working with cutting-edge LLM technologies, but the work environment could be more flexible and the salary benefits are average compared to industry standards.

Dec 01, 2025

Van Doe

Inference社は高性能なGPUクラスターを提供しており、LLMの推論環境として魅力的ですが、福利厚生が少々物足りない印象を受けました

Dec 10, 2025

Mia Doe

Работа в Inference предоставляет возможность работать с современными моделями ИИ, но условия сотрудничества и карьерный рост требуют больше внимания со стороны руководства

Language & Currency

Language

Currency

Inference

Welcome to Inference

Available Jobs

Latest Jobs

Inference Jobs & Careers

Inference's Reviews

New Things Will Always Update Regularly

New Things Will Always
Update Regularly