NM Testing

company

AI & ML interests

None defined yet.

Recent Activity

nm-autobot updated a model 39 minutes ago

nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Static-Asym-e2e

nm-autobot updated a model 43 minutes ago

nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Dynamic-Asym-e2e

nm-autobot updated a model about 1 hour ago

nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16-e2e

View all activity

nm-testing 's models 510

nm-testing/Meta-Llama-3-8B-Instruct-FP8-channel-output-activation

8B • Updated Mar 10, 2025 • 3

nm-testing/Llama-3.2-1B-W4A16-Transforms

4B • Updated Mar 7, 2025 • 4

nm-testing/Ministral-8B-Instruct-2410-FP8-dynamic

8B • Updated Mar 5, 2025 • 2

nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-G128-asym

0.3B • Updated Mar 4, 2025 • 3

nm-testing/Phi-4-mini-instruct-quantized.w4a16.asymmetric

2B • Updated Mar 3, 2025 • 15

nm-testing/Qwen1.5-MoE-A2.7B-Chat-quantized.w4a16

14B • Updated Feb 24, 2025 • 62.2k • 1

nm-testing/Moonlight-16B-A3B.w4a16

3B • Updated Feb 24, 2025 • 2

nm-testing/output_llama7b_2of4_w4a16_channel-main

Updated Feb 19, 2025

nm-testing/output_llama7b_2of4_w4a16_channel-refac

Updated Feb 19, 2025

nm-testing/quantization_2of4_sparse_w4a16

Updated Feb 19, 2025

nm-testing/Meta-Llama-3-8B-Instruct-W4A16-G128

2B • Updated Feb 17, 2025 • 3

nm-testing/Meta-Llama-3-8B-Instruct-W4A16-G128-refac

2B • Updated Feb 17, 2025 • 2

nm-testing/Meta-Llama-3-8B-Instruct-FP8-Dynamic

8B • Updated Feb 17, 2025 • 2

nm-testing/Meta-Llama-3-8B-Instruct-FP8-Dynamic-refac

8B • Updated Feb 17, 2025 • 2

nm-testing/whisper-large-v3.w4a16

Automatic Speech Recognition • 0.3B • Updated Feb 14, 2025 • 3 • 2

nm-testing/whisper-large-v2.w4a16

0.3B • Updated Feb 14, 2025 • 2

nm-testing/DeepSeek-Coder-V2-Lite-Instruct-FP8

Text Generation • 16B • Updated Feb 13, 2025 • 1.1k

nm-testing/llama2.c-stories42M-gsm8k-stacked-uncompressed

58.2M • Updated Feb 12, 2025 • 10.3k

nm-testing/llama2.c-stories42M-gsm8k-stacked-compressed

48.6M • Updated Feb 12, 2025 • 808

nm-testing/llama2.c-stories42M-gsm8k-sparse-only-uncompressed

58.1M • Updated Feb 12, 2025 • 10.5k

nm-testing/llama2.c-stories42M-gsm8k-sparse-only-compressed

48.6M • Updated Feb 12, 2025 • 934

nm-testing/llama2.c-stories42M-gsm8k-quantized-only-uncompressed

58.2M • Updated Feb 12, 2025 • 12k

nm-testing/llama2.c-stories42M-gsm8k-quantized-only-compressed

58.1M • Updated Feb 12, 2025 • 2.86k

nm-testing/Meta-Llama-3-8B-Instruct-AttnQuantOnly

8B • Updated Feb 11, 2025 • 2

nm-testing/Meta-Llama-3-8B-FP8-AttnQuant-WeightQuant

8B • Updated Feb 11, 2025 • 2

nm-testing/Meta-Llama-3-8B-FP8-AttnQuant

8B • Updated Feb 11, 2025 • 2

nm-testing/pixtral-12b-FP8-dynamic-all

13B • Updated Feb 7, 2025 • 2

nm-testing/pixtral-12b-W4A16-G128

3B • Updated Feb 7, 2025 • 4

nm-testing/Pixtral-Large-Instruct-2411-hf

Image-Text-to-Text • 124B • Updated Feb 6, 2025 • 9

nm-testing/Qwen2-VL-2B-Instruct-Sparse-0.6

2B • Updated Feb 3, 2025 • 3