Fyp25031: Generative AI for Researching Archaeology and Teaching
Project objective
Deliver a web-based Retrieval-Augmented Generation (RAG) assistant tailored for archaeology. It provides accurate, citable answers to classroom and research questions, supports user document uploads, and operates on a curated domain corpus (no base model retraining needed for new knowledge). All answers are grounded in retrieved evidence, with precise citations at the document, section, page, and sentence levels.
Project Background
Retrieval-Augmented Generation (RAG) is effective for reducing hallucinations and improving traceability in LLMs, but naive RAG struggles in specialized fields like archaeology. It may miss domain terminology/cross-lingual variants, rely on single similarity signals for inaccurate ranking, and lack answerability checks or sentence-level citations—issues amplified by archaeology’s heterogeneous sources (scanned PDFs, bilingual texts, inconsistent formatting) and knowledge constraints (period, region, site).
General LLMs also have hallucinations/over-generalization in archaeology coursework, while large-scale fine-tuning is costly. Building on Prof. Cobb’s prior archaeology RAG baseline (which needs improvements in domain understanding, evidence diversity, etc.), this project develops a strengthened RAG pipeline. It optimizes corpus construction, retrieval, ranking, and generation to fit archaeology, supporting accurate, citable answers for teaching (aligned with materials, academic integrity) and research (evidence coverage, strict attribution).
Project Schedule
| Phase | Dates | Goal |
| P0 — Kickoff & Environment | 10/01–10/07 | Repo and infrastructure ready |
| P1 — Corpus & Chunking | 10/01–10/31 | Heading/paragraph chunking and ingestion |
| P2 — Query Preprocessing & Routing | 11/01–11/15 | Normalization + rule-based routing |
| P3 — Initial Retrieval & Candidate Slimming | 11/16–12/10 | Stable candidate pool |
| P4 — Re-ranking & Fusion | 12/11–01/10 | Cross-Encoder ranking integrated |
| P5 — Generation (3-step + citations) | 01/11–02/05 | Three-step generation with strict citations |
| P6 — Evaluation & Tuning | 02/06–03/01 | Metrics & hyperparameter tuning |
| P7 — Hardening & Deliverables | 03/02–03/20 | Stabilization & artifacts |
| P8 — Final Polish & Submission | 03/21–03/30 | Final handoff |