Feed aggregator

Show HN: L88 – A Local RAG System on 8GB VRAM (Need Architecture Feedback)

Hacker News - Mon, 02/23/2026 - 11:57pm

Hey everyone,

I’ve been working on a project called L88 — a local RAG system that I initially focused on UI/UX for, so the retrieval and model architecture still need proper refinement.

Repo: https://github.com/Hundred-Trillion/L88-Full

I’m running this on 8GB VRAM and a strong CPU (128GB RAM). Embeddings and preprocessing run on CPU, and the main model runs on GPU. One limitation I ran into is that my evaluator and generator LLM ended up being the same model due to compute constraints, which defeats the purpose of evaluation.

I’d really appreciate feedback on:

Better architecture ideas for small-VRAM RAG

Splitting evaluator/generator roles effectively

Improving the LangGraph pipeline

Any bugs or design smells you notice

Ways to optimize the system for local hardware

I’m 18 and still learning a lot about proper LLM architecture, so any technical critique or suggestions would help me grow as a developer. If you check out the repo or leave feedback, it would mean a lot — I’m trying to build a solid foundation and reputation through real projects.

Thanks!

Comments URL: https://news.ycombinator.com/item?id=47133027

Points: 1

# Comments: 0

Categories: Hacker News

Epsteincoin Is Collapsing

Hacker News - Mon, 02/23/2026 - 11:11pm

Comments URL: https://news.ycombinator.com/item?id=47132771

Points: 1

# Comments: 0

Categories: Hacker News

Would you choose the Microsoft stack today if starting greenfield?

Hacker News - Mon, 02/23/2026 - 11:11pm

Serious question.

Outside government or heavily regulated enterprise, what is Microsoft’s core value prop in 2026?

It feels like a lot of adoption is inherited — contracts, compliance, enterprise trust, existing org gravity. Not necessarily technical preference.

If you were starting from scratch today with no legacy, no E5 contracts, no sunk cost — how many teams would actually choose the full MS stack over best-of-breed tools?

Curious what people here have actually chosen in greenfield builds.

Comments URL: https://news.ycombinator.com/item?id=47132766

Points: 1

# Comments: 0

Categories: Hacker News

What's the right trust model for an agent-to-agent network?

Hacker News - Mon, 02/23/2026 - 11:10pm

Article URL: https://platia.ai/

Comments URL: https://news.ycombinator.com/item?id=47132761

Points: 1

# Comments: 1

Categories: Hacker News

Show HN: LexReviewer – Because "Chat with PDF" is broken for legal workflows

Hacker News - Mon, 02/23/2026 - 11:05pm

Hi HN!

Most “chat with PDF” tools work fine until you try using them for something that actually matters, like contracts.

The issue isn’t that they can’t answer questions. It’s that you can’t trust the answers. They return something that sounds correct, but don’t clearly show where it came from, or they miss context from referenced clauses and related documents.

Legal docs make this harder because questions aren’t uniform: - sometimes you’re searching concepts - sometimes exact clause IDs - sometimes text from a different linked document

Most systems handle all of those the same way, which is where things break.

So I built LexReviewer, an open-source backend designed around a single rule: ""an answer isn’t useful unless you can verify it instantly.""

Instead of treating every query identically, it adapts its search strategy based on what you’re asking and can follow references across documents when needed. The result is answers that stay grounded in real text and point directly to the source passage.

Repo: https://github.com/LexStack-AI/LexReviewer

-- Currently tested on 300+ page contracts with cross-references

Feedback I’d especially value:

- Where do current document-AI systems fail hardest for you? - What’s been the biggest blocker to trusting AI outputs in production workflows? - If you’ve built something similar, what design choices ended up mattering most?

Comments URL: https://news.ycombinator.com/item?id=47132730

Points: 1

# Comments: 1

Categories: Hacker News

Railway surpasses 1M deployments per day

Hacker News - Mon, 02/23/2026 - 11:03pm

Article URL: https://railway.com/stats

Comments URL: https://news.ycombinator.com/item?id=47132712

Points: 1

# Comments: 0

Categories: Hacker News

Pages