.

A Model for the Automatic Mixing of Multiple Audio and Video Clips

LAUR Repository

Show simple item record

dc.contributor.author Hedna, Mohamed Rissal
dc.contributor.author Djemmal, Younes
dc.contributor.author Mershad, Khaleel
dc.date.accessioned 2024-03-08T14:40:28Z
dc.date.available 2024-03-08T14:40:28Z
dc.date.copyright 2023 en_US
dc.date.issued 2023-12-06
dc.identifier.isbn 9798350315660 en_US
dc.identifier.uri http://hdl.handle.net/10725/15396
dc.description.abstract Short-form videos are becoming a trend in today’s social media. A large number of tools provide users with the capability to edit and mix video clips with audio files to create a short-form video. A significant number of such videos currently exist on platforms such as TikTok and Instagram that were created by manual cut and mix of video and audio clips. In this paper, we examine the relationship that exists between video scenes and audio clips in order to identify the best mixing strategy of multiple video and audio files. For this purpose, we extract the beats and downbeats of the audio files and augment them with the scene changes in the video files. Next, we use the data to train two LSTM and RNN models to determine the best places to cut the video based on the audio data. Our system enables users to automatically create high-quality short-form videos by simply specifying the video clips and the audio file that should be mixed with them. We conducted experiments on 200 short-form videos. Our results illustrate the efficiency of the proposed system in determining the optimal cut scenes based on the characteristics of the selected audio file. en_US
dc.language.iso en en_US
dc.publisher IEEE en_US
dc.title A Model for the Automatic Mixing of Multiple Audio and Video Clips en_US
dc.type Conference Paper / Proceeding en_US
dc.author.school SAS en_US
dc.author.idnumber 202203388 en_US
dc.author.department Computer Science And Mathematics en_US
dc.publication.place Piscataway, N.J. en_US
dc.keywords Short-form video en_US
dc.keywords Automatic video editing en_US
dc.keywords Video and audio mixing en_US
dc.keywords LSTM en_US
dc.keywords RNN en_US
dc.description.bibliographiccitations Includes bibliographical references. en_US
dc.identifier.doi https://doi.org/10.1109/CW58918.2023.00025 en_US
dc.identifier.ctation Hedna, M. R., Djemmal, Y., & Mershad, K. (2023, October). A Model for the Automatic Mixing of Multiple Audio and Video Clips. In 2023 International Conference on Cyberworlds (CW) (pp. 110-117). IEEE. en_US
dc.author.email khaleel.mershad@lau.edu.lb en_US
dc.conference.date 03-05 October 2023 en_US
dc.conference.pages 110-117 en_US
dc.conference.place Sousse, Tunisia en_US
dc.conference.title 2023 International Conference on Cyberworlds (CW) en_US
dc.identifier.tou http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php en_US
dc.identifier.url https://ieeexplore.ieee.org/abstract/document/10337600 en_US
dc.orcid.id https://orcid.org/0000-0003-3786-5529 en_US
dc.publication.date 2023 en_US
dc.author.affiliation Lebanese American University en_US


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search LAUR


Advanced Search

Browse

My Account