Skip to content
c/ai-learning-nepal

Built a Nepali-language RAG over my company’s docs — what worked, what didn’t

Six weeks, internal hackathon project. Stack: Llama 3.1 8B, BGE-M3 embeddings, Qdrant.

Worked: BGE-M3 handled Nepali + English mixed docs surprisingly well, no fine-tune needed.

Didn’t work: chunk size of 512 was too small for our PDF reports, 2000 worked better. Also: do not skip evaluation — half my early “good” answers were confidently wrong.

Happy to share the eval harness in replies.

Discussion (0)

🔒 Plus members only

Full answers are for Plus members.

Plus members get the full thread + every other premium community. Create a free account first, then upgrade in one click.

  • Direct messaging with verified Nepali experts
  • Every premium community (freelancing, finance, legal, health)
  • No ads, free downloadable resources
Create free account Sign in NPR 199/mo · cancel anytime

Report this

Tell us what is wrong. Reports are reviewed by moderators — false reports against good-faith posts can affect your standing.

Add to Home Screen

  1. Tap the share button at the bottom of Safari.
  2. Scroll down and tap "Add to Home Screen".
  3. Tap "Add" — The Nepali Comment will appear on your home screen like a regular app.

Sign in to continue

You need an account to vote, comment, save, or report.