SitLLM: Large Language Models for Sitting Posture Health Understanding via Pressure Sensor Data

Abstract

Poor sitting posture is a critical yet often overlooked factor contributing to long-term musculoskeletal disorders and physiological dysfunctions. Existing sitting posture monitoring systems, although leveraging visual, IMU, or pressure-based modalities, often suffer from coarse-grained recognition and lack the semantic expressiveness necessary for personalized feedback. In this paper, we propose SitLLM, a lightweight multimodal framework that integrates flexible pressure sensing with large language models (LLMs) to enable fine-grained posture understanding and personalized health-oriented response generation. SitLLM comprises three key components: (1) a Gaussian-Robust Sensor Embedding Module that partitions pressure maps into spatial patches and injects local noise perturbations for robust feature extraction; (2) a Prompt-Driven Cross-Modal Alignment Module that reprograms sensor embeddings into the LLM's semantic space via multi-head cross-attention using the pre-trained vocabulary embeddings; and (3) a Multi-Context Prompt Module that fuses feature-level, structure-level, statistical-level, and semantic-level contextual information to guide instruction comprehension.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…