Hacker News
Ask HN: What do you think about Devin?
Anyone can build it with gpt4, then just swap to gpt4.5, it may cost 5k/month for 1 autonomous software engineer, but it will cost zero in a year, I will build Devin in a month if I this thread gets 10 likes
Comments URL: https://news.ycombinator.com/item?id=39688913
Points: 1
# Comments: 0
Ask HN: Where can I find practical comparative data regarding different LLMs?
I've been trying to keep up with the advances in the world of AI and LLMs. NLP was a world that I knew pretty well 7 years ago, when I knew most of the major NLP libraries, and their various strengths and weaknesses. However, nowadays, I'm having trouble finding good discussions about the real uses of the LLMs.
I have gone to Hugging Face, and the amount of data there is overwhelming, but it seems poorly organized:
https://huggingface.co
Does anyone know a secret that makes that site tractable? I've experimented with a few of the libraries posted there, but I can only sample a tiny fraction of what is there, and what I'm missing is some method for finding the useful stuff while disposing of the junk.
7 years ago I could tell you the strengths of weaknesses of the Google's Tensorflow or the Stanford NLP library. But where do I go to get good comparative information now, about the strengths and weaknesses of the various libraries that interact with the new LLM tools?
I'm looking to answer practical questions, that I can use in my own work with AI startups.
For an example of a question, for which I cannot find an answer, I am aware of a startup that has developed a chat client that, the startup says, can entirely replace a company's customer support team. Among the claims made by the startup is that when their chat client makes a mistake, it can be easily adjusted so it won't make that mistake any more. I am curious, what approaches are the engineers at that startup probably using to fix mistakes? If I search Hugging Face for ways to fix factual errors in LLMs then I see some libraries, but I've no idea what is considered good or bad.
So I asked the Hacker News community, how are you keeping up with advances around LLMs and associated tools?
Also, every LLM seems to have an embedded finite state machine that remembers the state of the current conversation, so where can I go to learn about the strengths and weaknesses of those finite state machines? How would I go about adjusting them?
Comments URL: https://news.ycombinator.com/item?id=39688911
Points: 2
# Comments: 0
First Cancer TIL Gene Therapy Gets FDA Approval for Advanced Melanoma
Article URL: https://www.cancer.gov/news-events/cancer-currents-blog/2024/fda-amtagvi-til-therapy-melanoma
Comments URL: https://news.ycombinator.com/item?id=39688900
Points: 2
# Comments: 0
CloudTrail events with detailed descriptions, MITRE ATT&CK insights
Article URL: https://traildiscover.cloud/
Comments URL: https://news.ycombinator.com/item?id=39688886
Points: 1
# Comments: 0
Techstars' $80M partnership with J.P. Morgan is on the rocks, employees say
Article URL: https://techcrunch.com/2024/03/08/techstars-80-million-partnership-with-j-p-morgan-is-on-the-rocks-employees-say/
Comments URL: https://news.ycombinator.com/item?id=39688878
Points: 2
# Comments: 0
Show HN: Writ.ly – Easy online Markdown editor
I absolutely love tldraw; it's a fascinating tool I use every day. And Bear Editor? Can't imagine my life without it. So, I set out to blend the best of both worlds as a toy project :3
I've also modularized the editor (https://github.com/writly/writly) as a React Component. Feel free to give it a try!
Comments URL: https://news.ycombinator.com/item?id=39688869
Points: 3
# Comments: 0
Who Owns the Moon? The Race for Lunar Real Estate Is an Ethical Nightmare
Article URL: https://www.inverse.com/science/moon-real-estate-ownership-ethics
Comments URL: https://news.ycombinator.com/item?id=39688865
Points: 1
# Comments: 0
How much do 155 mm artillery rounds cost now? And how many are fired in Ukraine?
Article URL: https://www.technology.org/2023/01/05/how-much-do-155-mm-artillery-rounds-cost-now-and-how-many-are-fired-in-ukraine/
Comments URL: https://news.ycombinator.com/item?id=39688831
Points: 4
# Comments: 1
Before Machine Learning Vol 2 – Calculus
Article URL: https://www.mldepot.co.uk
Comments URL: https://news.ycombinator.com/item?id=39688823
Points: 1
# Comments: 1
Give some new links a chance
Article URL: https://news.ycombinator.com/newest
Comments URL: https://news.ycombinator.com/item?id=39688802
Points: 29
# Comments: 2
Alpha-Omega Announces First Four Grants for Open Source Security of 2024
Article URL: https://alpha-omega.dev/blog/alpha-omega-announces-first-four-grants-of-2024-and-our-2024-okrs/
Comments URL: https://news.ycombinator.com/item?id=39688776
Points: 1
# Comments: 0
China's Best Self-Driving Car Platforms, Tested and Compared
Article URL: https://www.wired.com/story/chinas-best-self-driving-car-platforms-tested-and-compared-xpeng-nio-li-auto/
Comments URL: https://news.ycombinator.com/item?id=39688735
Points: 3
# Comments: 1
Tembo CLI: Infrastructure as code for the Postgres ecosystem
Article URL: https://tembo.io/blog/tembo-cli
Comments URL: https://news.ycombinator.com/item?id=39688667
Points: 1
# Comments: 0
Tenant-aware Serverless Postgres to build SaaS
Article URL: https://www.thenile.dev/
Comments URL: https://news.ycombinator.com/item?id=39688664
Points: 1
# Comments: 0
Motorola's newest budget phones look surprisingly good
Article URL: https://www.theverge.com/2024/3/12/24097606/motorola-moto-g-5g-power-price-screen-battery
Comments URL: https://news.ycombinator.com/item?id=39688654
Points: 1
# Comments: 0
Frugivore
Article URL: https://en.wikipedia.org/wiki/Frugivore
Comments URL: https://news.ycombinator.com/item?id=39688630
Points: 1
# Comments: 0