Tier 1 — Frontier us Menlo Park, USA

Meta AI

Open-Source LLM Entwickler (Llama)

Modelle
Llama 4Llama 3.3CodeLlama
Links
Tools (2)

Nougat

docprocessing

Visual Transformer OCR (Meta) fuer wissenschaftliche Dokumente (Mathe, Formeln).

Website

Meta Llama 4

models

Llama 4: Scout (10M Kontext!), Maverick, Behemoth. 200 Sprachen. Open Source.

Website
Letzte News & Updates (10)
14.05.2026 Artikel
STOP: Structured On-Policy Pruning of Long-Form Reasoning in Low-Data Regimes

arXiv:2605.13165v1 Announce Type: new Abstract: Long chain-of-thought (Long CoT) reasoning improves performance on mult...

14.05.2026 Artikel
A3 : an Analytical Low-Rank Approximation Framework for Attention

arXiv:2505.12942v4 Announce Type: replace Abstract: Large language models have demonstrated remarkable performance; how...

14.05.2026 Artikel
Pretraining large language models with MXFP4 on Native FP4 Hardware

arXiv:2605.09825v2 Announce Type: replace-cross Abstract: Why does full-pipeline FP4 training of large language models ...

14.05.2026 Artikel
Domain Adaptation of Large Language Models for Polymer-Composite Additive Manufacturing Using Retrieval-Augmented Generation and Fine-Tuning

arXiv:2605.12516v1 Announce Type: new Abstract: General-purpose large language models (LLMs) often struggle to generate...

14.05.2026 Artikel
ToolWeave: Structured Synthesis of Complex Multi-Turn Tool-Calling Dialogues

arXiv:2605.12521v1 Announce Type: new Abstract: Multi-turn tool calling is essential for LLMs to function as autonomous...

14.05.2026 Artikel
Children's English Reading Story Generation via Supervised Fine-Tuning of Compact LLMs with Controllable Difficulty and Safety

arXiv:2605.13709v1 Announce Type: new Abstract: Large Language Models (LLMs) are widely applied in educational practice...

14.05.2026 Artikel
Dense vs Sparse Pretraining at Tiny Scale: Active-Parameter vs Total-Parameter Matching

arXiv:2605.13769v1 Announce Type: new Abstract: We study dense and mixture-of-experts (MoE) transformers in a tiny-scal...

14.05.2026 Artikel
Grid-Orch: An LLM-Powered Orchestrator for Distribution Grid Simulation and Analytics

arXiv:2605.12728v1 Announce Type: cross Abstract: The power distribution engineering workforce faces a projected shorta...

14.05.2026 Artikel
High-Rate Quantized Matrix Multiplication II

arXiv:2605.13768v1 Announce Type: cross Abstract: This is the second part of the work investigating quantized matrix mu...

14.05.2026 Artikel
PEFT-Factory: Unified Parameter-Efficient Fine-Tuning of Autoregressive Large Language Models

arXiv:2512.02764v3 Announce Type: replace Abstract: Parameter-Efficient Fine-Tuning (PEFT) methods address the increasi...

← Alle Anbieter anzeigen