📊 Статистика дайджестов

Всего дайджестов: 34022 Добавлено сегодня: 82

Последнее обновление: сегодня

📄 GazeTrack: High-Precision Eye Tracking Based on Regularization and Spatial Computing

2025-12-02

Авторы:

Xiaoyin Yang

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Eye tracking has become increasingly important in virtual and augmented reality applications; however, the current gaze accuracy falls short of meeting the requirements for spatial computing. We designed a gaze collection framework and utilized high-precision equipment to gather the first precise benchmark dataset, GazeTrack, encompassing diverse ethnicities, ages, and visual acuity conditions for pupil localization and gaze tracking. We propose a novel shape error regularization method to const...

ID: 2511.22607v1 cs.CV, cs.AI, cs.HC, cs.LG

arXiv PDF

📄 Words into World: A Task-Adaptive Agent for Language-Guided Spatial Retrieval in AR

2025-12-02

Авторы:

Lixing Guo, Tobias Höllerer

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Traditional augmented reality (AR) systems predominantly rely on fixed class detectors or fiducial markers, limiting their ability to interpret complex, open-vocabulary natural language queries. We present a modular AR agent system that integrates multimodal large language models (MLLMs) with grounded vision models to enable relational reasoning in space and language-conditioned spatial retrieval in physical environments. Our adaptive task agent coordinates MLLMs and coordinate-aware perception ...

ID: 2512.00294v1 cs.CV, cs.AI, cs.HC

arXiv PDF

📄 IndEgo: A Dataset of Industrial Scenarios and Collaborative Work for Egocentric Assistants

2025-11-26

Авторы:

Vivek Chavan, Yasmina Imgrund, Tung Dao, Sanwantri Bai, Bosong Wang, Ze Lu, Oliver Heimann, Jörg Krüger

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

We introduce IndEgo, a multimodal egocentric and exocentric dataset addressing common industrial tasks, including assembly/disassembly, logistics and organisation, inspection and repair, woodworking, and others. The dataset contains 3,460 egocentric recordings (approximately 197 hours), along with 1,092 exocentric recordings (approximately 97 hours). A key focus of the dataset is collaborative work, where two workers jointly perform cognitively and physically intensive tasks. The egocentric reco...

ID: 2511.19684v1 cs.CV, cs.AI, cs.HC, cs.RO

arXiv PDF

📄 Real-Time Drivers' Drowsiness Detection and Analysis through Deep Learning

2025-11-18

Авторы:

ANK Zaman, Prosenjit Chatterjee, Rajat Sharma

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

A long road trip is fun for drivers. However, a long drive for days can be tedious for a driver to accommodate stringent deadlines to reach distant destinations. Such a scenario forces drivers to drive extra miles, utilizing extra hours daily without sufficient rest and breaks. Once a driver undergoes such a scenario, it occasionally triggers drowsiness during driving. Drowsiness in driving can be life-threatening to any individual and can affect other drivers' safety; therefore, a real-time det...

ID: 2511.12438v1 cs.CV, cs.AI, cs.HC, cs.LG

arXiv PDF

📄 Radar-APLANC: Unsupervised Radar-based Heartbeat Sensing via Augmented Pseudo-Label and Noise Contrast

2025-11-15

Авторы:

Ying Wang, Zhaodong Sun, Xu Cheng, Zuxian He, Xiaobai Li

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Frequency Modulated Continuous Wave (FMCW) radars can measure subtle chest wall oscillations to enable non-contact heartbeat sensing. However, traditional radar-based heartbeat sensing methods face performance degradation due to noise. Learning-based radar methods achieve better noise robustness but require costly labeled signals for supervised training. To overcome these limitations, we propose the first unsupervised framework for radar-based heartbeat sensing via Augmented Pseudo-Label and Noi...

ID: 2511.08071v1 cs.CV, cs.AI, cs.HC, eess.SP

arXiv PDF

📄 SASG-DA: Sparse-Aware Semantic-Guided Diffusion Augmentation For Myoelectric Gesture Recognition

2025-11-15

Авторы:

Chen Liu, Can Han, Weishi Xu, Yaqi Wang, Dahong Qian

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Surface electromyography (sEMG)-based gesture recognition plays a critical role in human-machine interaction (HMI), particularly for rehabilitation and prosthetic control. However, sEMG-based systems often suffer from the scarcity of informative training data, leading to overfitting and poor generalization in deep learning models. Data augmentation offers a promising approach to increasing the size and diversity of training data, where faithfulness and diversity are two critical factors to effec...

ID: 2511.08344v2 cs.CV, cs.AI, cs.HC

arXiv PDF

📄 SasMamba: A Lightweight Structure-Aware Stride State Space Model for 3D Human Pose Estimation

2025-11-15

Авторы:

Hu Cui, Wenqiang Hua, Renjing Huang, Shurui Jia, Tessai Hayama

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Recently, the Mamba architecture based on State Space Models (SSMs) has gained attention in 3D human pose estimation due to its linear complexity and strong global modeling capability. However, existing SSM-based methods typically apply manually designed scan operations to flatten detected 2D pose sequences into purely temporal sequences, either locally or globally. This approach disrupts the inherent spatial structure of human poses and entangles spatial and temporal features, making it difficu...

ID: 2511.08872v1 cs.CV, cs.AI, cs.HC

arXiv PDF

📄 FineSkiing: A Fine-grained Benchmark for Skiing Action Quality Assessment

2025-11-15

Авторы:

Yongji Zhang, Siqi Li, Yue Gao, Yu Jiang

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Action Quality Assessment (AQA) aims to evaluate and score sports actions, which has attracted widespread interest in recent years. Existing AQA methods primarily predict scores based on features extracted from the entire video, resulting in limited interpretability and reliability. Meanwhile, existing AQA datasets also lack fine-grained annotations for action scores, especially for deduction items and sub-score annotations. In this paper, we construct the first AQA dataset containing fine-grain...

ID: 2511.10250v1 cs.CV, cs.AI, cs.HC

arXiv PDF

📄 Accurate online action and gesture recognition system using detectors and Deep SPD Siamese Networks

2025-11-11

Авторы:

Mohamed Sanim Akremi, Rim Slama, Hedi Tabia

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Online continuous motion recognition is a hot topic of research since it is more practical in real life application cases. Recently, Skeleton-based approaches have become increasingly popular, demonstrating the power of using such 3D temporal data. However, most of these works have focused on segment-based recognition and are not suitable for the online scenarios. In this paper, we propose an online recognition system for skeleton sequence streaming composed from two main components: a detector ...

ID: 2511.05250v1 cs.CV, cs.AI, cs.HC

arXiv PDF

📄 AI Assisted AR Assembly: Object Recognition and Computer Vision for Augmented Reality Assisted Assembly

2025-11-11

Авторы:

Alexander Htet Kyaw, Haotian Ma, Sasa Zivkovic, Jenny Sabin

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

We present an AI-assisted Augmented Reality assembly workflow that uses deep learning-based object recognition to identify different assembly components and display step-by-step instructions. For each assembly step, the system displays a bounding box around the corresponding components in the physical space, and where the component should be placed. By connecting assembly instructions with the real-time location of relevant components, the system eliminates the need for manual searching, sorting...

ID: 2511.05394v1 cs.CV, cs.AI, cs.HC, H.5.2; H.5.1; I.4.8; I.2.6

arXiv PDF

Показано 1 - 10 из 27 записей