Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
|
anana_'s comments
login
anana_
38 days ago
|
parent
|
context
|
next
[–]
| on:
Something is afoot in the land of Qwen
I've had even better results using the dense 27B model -- less looping and churning on problems
androiddrew
38 days ago
|
parent
|
next
[–]
Which dense model are you referring to? The dense model isn’t finetuned for code instruction according to the model card.
anana_
38 days ago
|
root
|
parent
|
next
[–]
https://huggingface.co/Qwen/Qwen3.5-27B
I wasn't aware of that, which page mentions that?
zerebos
38 days ago
|
root
|
parent
|
next
[–]
Yeah the page you linked even shows the benchmarks in coding for this model, so I'd be curious where that claim comes from
anana_
52 days ago
|
parent
|
context
|
prev
[–]
| on:
Warren Buffett dumps $1.7B of Amazon stock
They own GEICO...
buildbot
52 days ago
|
parent
[–]
Oh; well that’s embarrassing haha
conorcleary
52 days ago
|
root
|
parent
[–]
Nah, you can bet there's a tax break in there somewhere
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: