📊 Статистика дайджестов

Всего дайджестов: 34022 Добавлено сегодня: 82

Последнее обновление: сегодня

📄 Using MLIR Transform to Design Sliced Convolution Algorithm

2025-11-26

Авторы:

Victor Ferrari, Marcio Pereira, Lucas Alvarenga, Gustavo Leite, Guido Araujo

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

This paper proposes SConvTransform, a Transform dialect extension that provides operations for optimizing 2D convolutions in MLIR. Its main operation, SConvOp, lowers Linalg convolutions into tiled and packed generic operations through a fully declarative transformation pipeline. The process is guided by a Convolution Slicing Analysis that determines tile sizes and data layout strategies based on input and filter shapes, as well as target architecture parameters. SConvOp handles edge cases by sp...

ID: 2511.18222v1 cs.CV, cs.LG, cs.PF

arXiv PDF

📄 A Study on Inference Latency for Vision Transformers on Mobile Devices

2025-10-31

Авторы:

Zhuojin Li, Marco Paolieri, Leana Golubchik

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Given the significant advances in machine learning techniques on mobile devices, particularly in the domain of computer vision, in this work we quantitatively study the performance characteristics of 190 real-world vision transformers (ViTs) on mobile devices. Through a comparison with 102 real-world convolutional neural networks (CNNs), we provide insights into the factors that influence the latency of ViT architectures on mobile devices. Based on these insights, we develop a dataset including ...

ID: 2510.25166v1 cs.CV, cs.LG, cs.PF

arXiv PDF

📄 CRAM: Large-scale Video Continual Learning with Bootstrapped Compression

2025-08-09

Авторы:

Shivani Mall, Joao F. Henriques

**Резюме** В статье предлагается метод CRAM (Continually Refreshed Amodal Memory) для решения проблемы внедрения нейронных сетей в среды с непрерывными потоками видеоданных. Одной из основных проблем в этой области является нехватка памяти для хранения больших объемов видеоданных, особенно при использовании методов ребуферизации. Авторы предлагают хранить не полноценные видео, а их компьютерно-визуальные коды (embeddings), что позволяет значительно снизить памятные требования. Однако проблема заключается в том, что обучение компрессора видео в режиме непрерывного обучения подвержено катастрофическому забыванию, что они решают путем обновления компрессора с помощью заархивированных версий старой модели. Разработанная схема была протестирована на больших наборах данных EpicKitchens-100 и Kinetics-700, где авторы доказали существенное превосходство CRAM по сравнению с предыдущими подходами, особенно при ограничениях на память.

Annotation:

Continual learning (CL) promises to allow neural networks to learn from continuous streams of inputs, instead of IID (independent and identically distributed) sampling, which requires random access to a full dataset. This would allow for much smaller storage requirements and self-sufficiency of deployed systems that cope with natural distribution shifts, similarly to biological learning. We focus on video CL employing a rehearsal-based approach, which reinforces past samples from a memory buffer...

ID: 2508.05001v1 cs.CV, cs.LG, cs.PF

arXiv PDF