nano-JEPA: Una propuesta para posibilitar la interpretación de video usando computadoras personales

Adrián Rostagno; Javier Iparraguirre; Joel Ermantraut; Guillermo R. Friedrich

Authors

Adrián Rostagno Universidad Tecnológica Nacional, Facultad Regional Bahía Blanca, Argentina.
Javier Iparraguirre Universidad Tecnológica Nacional, Facultad Regional Bahía Blanca, Argentina.
Joel Ermantraut Universidad Tecnológica Nacional, Facultad Regional Bahía Blanca, Argentina.
Guillermo R. Friedrich Universidad Tecnológica Nacional, Facultad Regional Bahía Blanca, Argentina.

Keywords:

Feature Prediction, Unsupervised Learning, Visual Representations, Video, JEPA

Abstract

V-JEPA is an artificial intelligence model whose objective is to understand and predict video content. Uses a self-supervised learning approach; It is pretrained on unlabeled data and then tailored to specific tasks. It learns by predicting missing or masked parts of a video, forcing the model to understand and develop a comprehensive view of the scene. It aims to develop artificial intelligence that learns in a similar way to humans, forming internal models of the world around them to adapt and complete tasks efficiently. However, their enormous computational demands, which often require powerful GPU clusters, limit accessibility for many researchers. Therefore, nano-JEPA, an adaptation of V-JEPA, is proposed to run on personal computers, even without GPU. The nano-dataset repository is also presented, which facilitates the creation of manageable subsets from large public video data sets. The goal is to enable greater participation and experimentation inresearch with models similar to V-JEPA. Reasonable performance of nano-JEPA could be observed in subsequent tasks, opening doors for further exploration and innovation.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

nano-JEPA: A Proposal to Enable the Video Understanding Using Personal Computers

Authors

Keywords:

Abstract

Downloads

Metrics

Downloads

Published

How to Cite

Conference Proceedings Volume

Section

License

Most read articles by the same author(s)

ISSN

ISSN : 2683-8818

Language

contador

des

Current Conference Proceedings Volume