The following language models are currently available on the Lehmus AI platform. The list below shows the typical use cases of each model as well as the most important supported features.
Gemma 4 31B
Use case: general-purpose, analysis, reasoning, coding
- google/gemma-4-31b-it
- model id: jwwqblcgkizlhxjjbkcp
- enable-auto-tool-choice
- reasoning-parser=gemma4
- tool-call-parser=gemma4
- max-model-len=131072
Gemma 4 26B MoE
Use case: long materials, summarization, analysis, coding
- google/gemma-4-26B-A4B-it
- model id: wnofibsjlomanbprgbdp
- enable-auto-tool-choice
- reasoning-parser=gemma4
- tool-call-parser=gemma4
- max-model-len=131072
GPT OSS 120B
Use case: analysis, problem-solving, agents, tools
- openai/gpt-oss-120b
- model id: azsydsttjnlbfjbgqnwd
- enable-auto-tool-choice
- reasoning-parser=openai_gptoss
- tool-call-parser=openai
- kv-cache-dtype=fp8
Qwen3.6-35B-A3B
Use case: coding, agents, reasoning, tool calls
- qwen/qwen3.6-35B-A3B
- model id: ctraxdkjuwfrlivzrhgt
- enable-auto-tool-choice
- reasoning-parser=qwen3
- tool-call-parser=qwen3_coder
- max-model-len=131072
Qwen3.6-27B
Use case: general-purpose, conversation, analysis, coding
- qwen/qwen3.6-27B
- model id: hggelmtxxpzxwqucbjha
- enable-auto-tool-choice
- reasoning-parser=qwen3
- tool-call-parser=qwen3_coder
- max-model-len=262144
- speculative-config.num_speculative_tokens=2
- speculative-config.method=mtp
ICT Services aim to keep the model catalog as diverse as possible and to add new models whenever feasible. However, capacity is limited, so not all models can be offered. Instead, the selection is developed to support as many use cases as possible.
Is the language model you want not available?
Please first check whether you could use one of the existing language models for your purpose. You can request a new language model for the platform using the following form: requesting a new language model. After you submit a request for a new language model, our team will evaluate it based on the following criteria:
- Hardware compatibility: Assessment of VRAM requirements in relation to the available resources
- Data protection: We ensure that the model can be run safely in our environment without known side effects