Open to Work

Joseph Robert Turcotte PRO

Fishtiks

AI & ML interests

Roleplaying, lorabration, abliteration, smol models, extensive filtering, unusual datasets, home usage, HPCs for AI, distributed training/federated learning, and sentience. AI should find and label AI hallucinations with GANs so we can give them context and use.

Recent Activity

upvoted a paper 2 days ago

REAP the Experts: Why Pruning Prevails for One-Shot MoE compression

liked a model 2 days ago

Nanbeige/Nanbeige4.1-3B

upvoted an article 2 days ago

Building a Mood-Based Movie Recommendation Engine with Voyage-4-nano, Hugging Face, and MongoDB Atlas Vector Search

View all activity

Organizations

upvoted a paper 2 days ago

REAP the Experts: Why Pruning Prevails for One-Shot MoE compression

Paper • 2510.13999 • Published Oct 15, 2025 • 12

liked a model 2 days ago

Nanbeige/Nanbeige4.1-3B

Text Generation • 4B • Updated about 6 hours ago • 2.68k • 265

upvoted an article 2 days ago

Article

Building a Mood-Based Movie Recommendation Engine with Voyage-4-nano, Hugging Face, and MongoDB Atlas Vector Search

5 days ago

•

liked a model 2 days ago

cerebras/MiniMax-M2.1-REAP-139B-A10B

Text Generation • Updated 17 days ago • 3.37k • 30

liked a Space 2 days ago

amethyst

🚀

Explore the amethyst Gradio UI theme and its widgets

liked a model 2 days ago

zai-org/GLM-5

Text Generation • 754B • Updated about 4 hours ago • 13.9k • • 944

reacted to MikeDoes's post with 🚀 2 days ago

Post

5302

Can you teach a giant like Google's Gemini to protect user privacy? A new step-by-step guide shows that the answer is a resounding "yes."

While powerful, large language models aren't specialized for privacy tasks. This tutorial by Analytics Vidhya walks through how to fine-tune Gemini into a dedicated tool for PII anonymization.

To teach the model this critical skill, the author needed a robust dataset with thousands of clear 'before' and 'after' examples.

We're thrilled they chose the Ai4Privacy pii-masking-200k dataset for this task. Our data provided the high-quality, paired examples of masked and unmasked text necessary to effectively train Gemini to identify and hide sensitive information accurately.

This is a perfect example of how the community can use open-source data to add a crucial layer of safety to the world's most powerful models. Great work!

🔗 Check out the full tutorial here: https://www.analyticsvidhya.com/blog/2024/03/guide-to-fine-tuning-gemini-for-masking-pii-data/

🚀 Stay updated on the latest in privacy-preserving AI—follow us on LinkedIn: https://www.linkedin.com/company/ai4privacy/posts/

#DataPrivacy #AI #LLM #FineTuning #Anonymization #GoogleGemini #Ai4Privacy #World's largest open privacy masking dataset

liked a model 4 days ago

qwp4w3hyb/Meta-Llama-3.1-70B-Instruct-iMat-GGUF

Text Generation • 71B • Updated Aug 6, 2024 • 3.11k • 8

upvoted 4 papers 4 days ago

BM25S: Orders of magnitude faster lexical search via eager sparse scoring

Paper • 2407.03618 • Published Jul 4, 2024 • 14

Multi-Task GRPO: Reliable LLM Reasoning Across Tasks

Paper • 2602.05547 • Published 8 days ago • 12

DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers

Paper • 2602.02016 • Published 11 days ago • 11

BABE: Biology Arena BEnchmark

Paper • 2602.05857 • Published 8 days ago • 10

upvoted a collection 4 days ago

propella-1

Collection

Small multilingual LLMs for annotating and curating LLM training data. • 4 items • Updated 29 days ago • 4

liked a Space 4 days ago

NSFW FLUX Uncensored Photo

⚡

609

NSFW FLUX Uncensored photo 'Text & Imagery for AI Limits'

reacted to mitkox's post with 👍 4 days ago

Post

4595

I just pushed Claude Code Agent Swarm with 20 coding agents on my desktop GPU workstation.

With local AI, I don’t have /fast CC switch, but I have /absurdlyfast:
- 100’499 tokens/second read, yeah 100k, not a typo | 811 tok/sec generation
- KV cache: 707’200 tokens
- Hardware: 5+ year old GPUs 4xA6K gen1; It’s not the car. It’s the driver.

Qwen3 Coder Next AWQ with cache at BF16. Scores 82.1% in C# on 29-years-in-dev codebase vs Opus 4.5 at only 57.5%. When your codebase predates Stack Overflow, you don't need the biggest model; you need the one that actually remembers Windows 95.

My current bottleneck is my 27" monitor. Can't fit all 20 Theos on screen without squinting.

3 replies

upvoted a paper 4 days ago

Learning to Reason in 13 Parameters

Paper • 2602.04118 • Published 10 days ago • 5

reacted to MikeDoes's post with 🔥🚀 4 days ago

Post

3649

You don't need a massive research lab to build a privacy-preserving AI tool thanks to open datasets. With the right ingredients, anyone can.

A fantastic new guide shows how the democratization of AI is helping to advance safety. It walks through how to use Google's new fine-tuning API to turn Gemini into a powerful tool for PII anonymization.

This project was powered by two key components:

An accessible platform from Google.

High-quality, open-source training data.

We are honored that the author chose the Ai4Privacy pii-masking-200k dataset to provide the crucial data foundation. Our dataset delivered the volume and structure needed to successfully teach a state-of-the-art model how to perform a critical privacy function.

This is the future we're working towards: powerful platforms combined with open, safety-focused data to create tools that benefit everyone. Kudos to the author for showcasing what's possible!

🔗 Read the full step-by-step guide: https://www.analyticsvidhya.com/blog/2024/03/guide-to-fine-tuning-gemini-for-masking-pii-data/

🚀 Stay updated on the latest in privacy-preserving AI—follow us on LinkedIn: https://www.linkedin.com/company/ai4privacy/posts/

#AIforGood #DemocratizeAI #DataPrivacy #Anonymization #OpenSource #LLM #Ai4Privacy

2 replies

upvoted a paper 4 days ago

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

Paper • 2601.02151 • Published Jan 5 • 109

commented a paper 4 days ago

Towards Resiliency in Large Language Model Serving with KevlarFlow

Paper • 2601.22438 • Published 15 days ago • 3 •

Joseph Robert Turcotte PRO

AI & ML interests

Recent Activity

Organizations

Fishtiks's activity

Building a Mood-Based Movie Recommendation Engine with Voyage-4-nano, Hugging Face, and MongoDB Atlas Vector Search

amethyst

NSFW FLUX Uncensored Photo