34 24 14

Denis Kuznedelev

SpiridonSunRotator

https://github.com/Godofnothing

Godofnothing

AI & ML interests

Model compression, computer vision, NLP

Recent Activity

updated a model 20 days ago

daslab-testing/Llama-3.1-8B-Instruct-FPQuant-GPTQ-MXFP4-hadamard-scale_tuning

published a model 20 days ago

daslab-testing/Llama-3.1-8B-Instruct-FPQuant-GPTQ-MXFP4-hadamard-scale_tuning

updated a model 20 days ago

daslab-testing/Llama-3.1-8B-Instruct-FPQuant-GPTQ-MXFP4-identity-scale_tuning

View all activity

Organizations

upvoted 2 papers 2 months ago

The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm

Paper • 2507.18553 • Published Jul 24 • 40

nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17 • 122

upvoted a paper 3 months ago

MADrive: Memory-Augmented Driving Scene Modeling

Paper • 2506.21520 • Published Jun 26 • 36

upvoted 6 papers 4 months ago

Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models

Paper • 2506.06751 • Published Jun 7 • 71

Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Paper • 2505.21115 • Published May 27 • 139

Unified Scaling Laws for Compressed Representations

Paper • 2506.01863 • Published Jun 2 • 19

Alchemist: Turning Public Text-to-Image Data into Generative Gold

Paper • 2505.19297 • Published May 25 • 83

Position of Uncertainty: A Cross-Linguistic Study of Positional Bias in Large Language Models

Paper • 2505.16134 • Published May 22 • 18

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Paper • 2505.14669 • Published May 20 • 77

upvoted an article 5 months ago

Article

Vision Language Models (Better, Faster, Stronger)

and 4 others •

May 12

• 535

upvoted 3 papers 6 months ago

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published Apr 8 • 110

One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation

Paper • 2503.13358 • Published Mar 17 • 95

Scale-wise Distillation of Diffusion Models

Paper • 2503.16397 • Published Mar 20 • 41

upvoted an article 7 months ago

Article

Digest of models based on YandexGPT 5 Lite

•

Mar 19

• 32

upvoted 2 papers 7 months ago

RuCCoD: Towards Automated ICD Coding in Russian

Paper • 2502.21263 • Published Feb 28 • 132

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20 • 192

upvoted a paper 8 months ago

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations

Paper • 2502.05003 • Published Feb 7 • 43

upvoted a paper 10 months ago

Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis

Paper • 2412.01819 • Published Dec 2, 2024 • 35

upvoted a paper 11 months ago

EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search

Paper • 2410.14649 • Published Oct 18, 2024 • 9

upvoted a paper about 1 year ago

Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization

Paper • 2409.00492 • Published Aug 31, 2024 • 11