Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Eldar Kurtić's picture
23 5 2

Eldar Kurtić

ekurtic
rahulsinghal's profile picture markurtz's profile picture rgreenberg1's profile picture
·
  • _EldarKurtic
  • eldarkurtic
  • eldar-kurtić-77963b160

AI & ML interests

Efficient inference

Recent Activity

updated a model 6 days ago
RedHatAI/NVIDIA-Nemotron-Nano-9B-v2-quantized.w4a16
published a model 6 days ago
RedHatAI/NVIDIA-Nemotron-Nano-9B-v2-quantized.w4a16
updated a model 8 days ago
nm-testing/NVIDIA-Nemotron-Nano-9B-v2-quantized.w4a16
View all activity

Organizations

Neural Magic's profile picture  IST Austria Distributed Algorithms and Systems Lab's profile picture NM Testing's profile picture Red Hat AI's profile picture ISTA DASLab Testing Account's profile picture wut?'s profile picture

upvoted a collection about 1 month ago

Speculator Models

Collection
10 items • Updated Sep 19 • 3
upvoted a paper 12 months ago

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 51
upvoted a collection about 1 year ago

Llama-3.2 Quantization

Collection
Llama 3.2 models quantized by Neural Magic • 9 items • Updated Sep 26, 2024 • 9
upvoted a paper over 1 year ago

Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment

Paper • 2405.03594 • Published May 6, 2024 • 7
upvoted a paper about 2 years ago

Sparse Finetuning for Inference Acceleration of Large Language Models

Paper • 2310.06927 • Published Oct 10, 2023 • 15
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs