Gradio

Export Destination

Select one or more destinations for the compiled model.

Create a PR in the cache repository Create new Neuron-optimized repository Create a PR in a custom repository

Custom Cache Repository

Repository to store and fetch from compilation cache artifacts (default: aws-neuron/optimum-neuron-cache)

Model Type

Choose the type of model you want to export

transformers diffusers (soon)

{}

Task (auto can infer from model)

Logs

optimum-neuron version: 0.4.1

This Space allows you to automatically export 🤗 transformers to AWS Neuron-optimized format for Inferentia/Trainium acceleration.

Simply provide a model ID from the Hugging Face Hub, and choose your desired output.

✨ Key Features

🚀 Create a New Optimized Repo: Automatically converts your model and uploads it to a new repository under your username (e.g., your-username/model-name-neuron).
🔗 Link Back to Original: Creates a Pull Request on the original model's repository to add a link to your optimized version, making it easier for the community to discover.
🛠️ PR to a Custom Repo: For custom workflows, you can create a Pull Request to add the optimized files directly into an existing repository you own.
📦 Contribute to Cache: Contribute the generated compilation artifacts to a centralized cache repository (or your own private cache), helping avoid recompilation of already exported models.

⚙️ How to Use

Model ID: Enter the ID of the model you want to export (e.g., bert-base-uncased or stabilityai/stable-diffusion-xl-base-1.0) and choose the corresponding task.
Export Options: Select at least one option for where to save the exported model. You can provide your own cache repo ID or use the default (aws-neuron/optimum-neuron-cache).
Convert & Upload: Click the button and follow the logs to track progress!

🎨 Task Categories Legend

Feature Extraction NLP Text Generation Audio Vision Multimodal Similarity

🤗 Transformers

Architecture	Supported Tasks
ALBERT	feature-extraction fill-mask multiple-choice question-answering text-classification token-classification
AST	feature-extraction audio-classification
BERT	feature-extraction fill-mask multiple-choice question-answering text-classification token-classification
BLOOM	text-generation
Beit	feature-extraction image-classification
CamemBERT	feature-extraction fill-mask multiple-choice question-answering text-classification token-classification
CLIP	feature-extraction image-classification
ConvBERT	feature-extraction fill-mask multiple-choice question-answering text-classification token-classification
ConvNext	feature-extraction image-classification
ConvNextV2	feature-extraction image-classification
CvT	feature-extraction image-classification
DeBERTa (INF2 only)	feature-extraction fill-mask multiple-choice question-answering text-classification token-classification
DeBERTa-v2 (INF2 only)	feature-extraction fill-mask multiple-choice question-answering text-classification token-classification
Deit	feature-extraction image-classification
DistilBERT	feature-extraction fill-mask multiple-choice question-answering text-classification token-classification
DonutSwin	feature-extraction
Dpt	feature-extraction
ELECTRA	feature-extraction fill-mask multiple-choice question-answering text-classification token-classification
ESM	feature-extraction fill-mask text-classification token-classification
FlauBERT	feature-extraction fill-mask multiple-choice question-answering text-classification token-classification
GPT2	text-generation
Hubert	feature-extraction automatic-speech-recognition audio-classification
Levit	feature-extraction image-classification
Llama, Llama 2, Llama 3	text-generation
Mistral	text-generation
Mixtral	text-generation
MobileBERT	feature-extraction fill-mask multiple-choice question-answering text-classification token-classification
MobileNetV2	feature-extraction image-classification semantic-segmentation
MobileViT	feature-extraction image-classification semantic-segmentation
ModernBERT	feature-extraction fill-mask text-classification token-classification
MPNet	feature-extraction fill-mask multiple-choice question-answering text-classification token-classification
OPT	text-generation
Phi	feature-extraction text-classification token-classification
RoBERTa	feature-extraction fill-mask multiple-choice question-answering text-classification token-classification
RoFormer	feature-extraction fill-mask multiple-choice question-answering text-classification token-classification
Swin	feature-extraction image-classification
T5	text2text-generation
UniSpeech	feature-extraction automatic-speech-recognition audio-classification
UniSpeech-SAT	feature-extraction automatic-speech-recognition audio-classification audio-frame-classification audio-xvector
ViT	feature-extraction image-classification
Wav2Vec2	feature-extraction automatic-speech-recognition audio-classification audio-frame-classification audio-xvector
WavLM	feature-extraction automatic-speech-recognition audio-classification audio-frame-classification audio-xvector
Whisper	automatic-speech-recognition
XLM	feature-extraction fill-mask multiple-choice question-answering text-classification token-classification
XLM-RoBERTa	feature-extraction fill-mask multiple-choice question-answering text-classification token-classification
Yolos	feature-extraction object-detection

🧨 Diffusers

Architecture	Supported Tasks
Stable Diffusion	text-to-image image-to-image inpaint
Stable Diffusion XL Base	text-to-image image-to-image inpaint
Stable Diffusion XL Refiner	image-to-image inpaint
SDXL Turbo	text-to-image image-to-image inpaint
LCM	text-to-image
PixArt-α	text-to-image
PixArt-Σ	text-to-image
Flux	text-to-image

🤖 Sentence Transformers

Architecture	Supported Tasks
Transformer	feature-extraction sentence-similarity
CLIP	feature-extraction zero-shot-image-classification

💡 Note: Some architectures may have specific requirements or limitations. DeBERTa models are only supported on INF2 instances.