Alysson Guimarães's picture

Alysson Guimarães

k3ybladewielder

·

https://orcid.org/0000-0002-1035-8992

AI & ML interests

NLP, Commonsense Reasoning

Recent Activity

upvoted an article 5 days ago

Introducing the Synthetic Data Generator - Build Datasets with Natural Language

upvoted an article 26 days ago

Mixture of Experts (MoEs) in Transformers

liked a model about 1 month ago

Octavio-Santana/deberta-v3-base-prompt-injection-detection

View all activity

Organizations

None yet

upvoted an article 5 days ago

Article

Introducing the Synthetic Data Generator - Build Datasets with Natural Language

+4

Dec 16, 2024

•

155

upvoted an article 26 days ago

Article

Mixture of Experts (MoEs) in Transformers

+5

28 days ago

•

144

upvoted a collection about 2 months ago

Kimi-K2

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 5 items • Updated Jan 27 • 172

upvoted a paper 5 months ago

Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning

Paper • 2509.24372 • Published Sep 29, 2025 • 12

upvoted a paper 6 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 193

upvoted an article 7 months ago

Article

A Survey of Small Language Models in the Era of LLMs: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness

Jul 16, 2025

•

4

upvoted an article 8 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

+10

Aug 5, 2025

•

511

upvoted a collection 8 months ago

Sumarização Abtrativa em Português

5 items • Updated Jan 30, 2024 • 1

upvoted 3 papers 8 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 320

Lizard: An Efficient Linearization Framework for Large Language Models

Paper • 2507.09025 • Published Jul 11, 2025 • 19

Test-Time Scaling with Reflective Generative Model

Paper • 2507.01951 • Published Jul 2, 2025 • 108

upvoted 2 collections 9 months ago

MedGemma Release

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 9 items • Updated 13 days ago • 453

MiniMax-M1

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. • 6 items • Updated Feb 13 • 118

upvoted an article 9 months ago

Article

Getting Started With Embeddings

Jun 23, 2022

•

103

upvoted 3 collections 9 months ago

Gemma 3 Release

28 items • Updated 13 days ago • 627

Qwen3

84 items • Updated Dec 31, 2025 • 1.73k

Qwen3-Embedding

6 items • Updated Dec 31, 2025 • 154

upvoted 2 articles about 1 year ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

+3

May 24, 2023

•

176

Article

Vision Language Models Explained

Apr 11, 2024

•

529

upvoted a collection about 1 year ago

Jamba 1.6

The AI21 Jamba family of models are hybrid SSM-Transformer foundation models, outperforming open model competitors on quality and speed. • 2 items • Updated Mar 6, 2025 • 18