LLMforPSG

Code repository for the master thesis Adapting Large Language Models for Structured Information Extraction from Sleep Study Reports.

This repository contains the main scripts used for preprocessing PSG reports, constructing prompts, running local LLM inference, post-processing extracted JSON outputs, and comparing experiment results.

Clinical report data, manually annotated reference files, generated model outputs, and Jupyter notebooks containing non-shareable information are not included for privacy reasons.

psg_preprocess.py: conversion and text extraction from historical Word reports.
psg_extractPL.py: local LLM inference pipeline.
prompts.py: prompt variants used in the experiments.
run_queue.py: experiment queue for model/prompt runs.
psg_post.py: post-processing and schema harmonisation.
compare_experiments.py: evaluation and comparison of experiment outputs.

Note

The code is provided for transparency and reproducibility of the thesis methodology. It cannot be run end-to-end without access to the private clinical report archive, reference annotations, and local model environment.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
compare_experiments.py		compare_experiments.py
prompts.py		prompts.py
psg_extractPL.py		psg_extractPL.py
psg_post.py		psg_post.py
psg_pre.py		psg_pre.py
psg_preprocess.py		psg_preprocess.py
run_queue.py		run_queue.py
system_prompt.txt		system_prompt.txt
user_prompt.txt		user_prompt.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLMforPSG

Contents

Note

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLMforPSG

Contents

Note

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages