nano-JEPA: A Proposal to Enable the Video Understanding Using Personal Computers

Authors

  • Adrián Rostagno Universidad Tecnológica Nacional, Facultad Regional Bahía Blanca, Argentina.
  • Javier Iparraguirre Universidad Tecnológica Nacional, Facultad Regional Bahía Blanca, Argentina.
  • Joel Ermantraut Universidad Tecnológica Nacional, Facultad Regional Bahía Blanca, Argentina.
  • Guillermo R. Friedrich Universidad Tecnológica Nacional, Facultad Regional Bahía Blanca, Argentina.

Keywords:

Feature Prediction, Unsupervised Learning, Visual Representations, Video, JEPA

Abstract

V-JEPA is an artificial intelligence model whose objective is to understand and predict video content. Uses a self-supervised learning approach; It is pretrained on unlabeled data and then tailored to specific tasks. It learns by predicting missing or masked parts of a video, forcing the model to understand and develop a comprehensive view of the scene. It aims to develop artificial intelligence that learns in a similar way to humans, forming internal models of the world around them to adapt and complete tasks efficiently. However, their enormous computational demands, which often require powerful GPU clusters, limit accessibility for many researchers. Therefore, nano-JEPA, an adaptation of V-JEPA, is proposed to run on personal computers, even without GPU. The nano-dataset repository is also presented, which facilitates the creation of manageable subsets from large public video data sets. The goal is to enable greater participation and experimentation inresearch with models similar to V-JEPA. Reasonable performance of nano-JEPA could be observed in subsequent tasks, opening doors for further exploration and innovation.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

Published

2025-07-15

How to Cite

Rostagno, A., Iparraguirre, J., Ermantraut, J., & Friedrich, G. R. (2025). nano-JEPA: A Proposal to Enable the Video Understanding Using Personal Computers. AJEA (Proceedings of UTN Academic Conferences and Events), (AJEA 47). Retrieved from https://rtyc.utn.edu.ar/index.php/ajea/article/view/1875

Conference Proceedings Volume

Section

Proceedings - Information and Computer Systems