TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Hysia: Serving DNN-Based Video-to-Retail Applications in C...

Hysia: Serving DNN-Based Video-to-Retail Applications in Cloud

Huaizheng Zhang, Yuanming Li, Qiming Ai, Yong Luo, Yonggang Wen, Yichao Jin, Nguyen Binh Duong Ta

2020-06-09Video RetrievalVideo-to-Shop
PaperPDFCodeCode(official)

Abstract

Combining \underline{v}ideo streaming and online \underline{r}etailing (V2R) has been a growing trend recently. In this paper, we provide practitioners and researchers in multimedia with a cloud-based platform named Hysia for easy development and deployment of V2R applications. The system consists of: 1) a back-end infrastructure providing optimized V2R related services including data engine, model repository, model serving and content matching; and 2) an application layer which enables rapid V2R application prototyping. Hysia addresses industry and academic needs in large-scale multimedia by: 1) seamlessly integrating state-of-the-art libraries including NVIDIA video SDK, Facebook faiss, and gRPC; 2) efficiently utilizing GPU computation; and 3) allowing developers to bind new models easily to meet the rapidly changing deep learning (DL) techniques. On top of that, we implement an orchestrator for further optimizing DL model serving performance. Hysia has been released as an open source project on GitHub, and attracted considerable attention. We have published Hysia to DockerHub as an official image for seamless integration and deployment in current cloud environments.

Related Papers

Q2E: Query-to-Event Decomposition for Zero-Shot Multilingual Text-to-Video Retrieval2025-06-11MAGMaR Shared Task System Description: Video Retrieval with OmniEmbed2025-06-11From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained Videos2025-06-05Leveraging Auxiliary Information in Text-to-Video Retrieval: A Review2025-05-29Learning World Models for Interactive Video Generation2025-05-28LoVR: A Benchmark for Long Video Retrieval in Multimodal Contexts2025-05-20A Challenge to Build Neuro-Symbolic Video Agents2025-05-20Video-GPT via Next Clip Diffusion2025-05-18