Very cool! Any plans to open source?
\n","updatedAt":"2025-02-10T22:13:02.723Z","author":{"_id":"62e54f0eae9d3f10acb95cb9","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/62e54f0eae9d3f10acb95cb9/VAyk05hqB3OZWXEZW-B0q.png","fullname":"mrfakename","name":"mrfakename","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":2913}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.8397917151451111},"editors":["mrfakename"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/62e54f0eae9d3f10acb95cb9/VAyk05hqB3OZWXEZW-B0q.png"],"reactions":[{"reaction":"👀","users":["bloomedout","nashah25","HeartofSheep","Rico123","light70","majxxx","pranavred","matrunchyk","invictus-axl","Narayanan007","jreynolds","bachir56","febryards","YoussefKhezami","unknownmixorgacc","huangzilong528","ernestyalumni","onlinework","huggingandrew12","kevin-hunan-lee","marksmithlive","kazuki-nectar","AymanKing","DealayLomoi"],"count":24}],"isReport":false,"parentCommentId":"67a9903f8a919ad008ad7549"}},{"id":"67b5e974bc707d7ed385c582","author":{"_id":"67b5e93e8c245b2e5f2b99ce","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/xrg_KsO_yQxNhRTn4Ro9G.png","fullname":"ruryrt","name":"zloktor","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false},"createdAt":"2025-02-19T14:23:48.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"mathematician","html":"mathematician
\n","updatedAt":"2025-02-19T14:23:48.738Z","author":{"_id":"67b5e93e8c245b2e5f2b99ce","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/xrg_KsO_yQxNhRTn4Ro9G.png","fullname":"ruryrt","name":"zloktor","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9685593843460083},"editors":["zloktor"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/xrg_KsO_yQxNhRTn4Ro9G.png"],"reactions":[],"isReport":false,"parentCommentId":"67a9903f8a919ad008ad7549"}},{"id":"67b7859cf71f1b6489a256f5","author":{"_id":"65d22b6adf205f2d8c932d65","avatarUrl":"/avatars/bc167fa6d467b979ee2c6aa6f046e229.svg","fullname":"Maksim","name":"gryzly","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false},"createdAt":"2025-02-20T19:42:20.000Z","type":"comment","data":{"edited":true,"hidden":false,"latest":{"raw":"2","html":"2
\n","updatedAt":"2025-02-20T19:43:16.325Z","author":{"_id":"65d22b6adf205f2d8c932d65","avatarUrl":"/avatars/bc167fa6d467b979ee2c6aa6f046e229.svg","fullname":"Maksim","name":"gryzly","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false}},"numEdits":1,"identifiedLanguage":{"language":"en","probability":0.9772802591323853},"editors":["gryzly"],"editorAvatarUrls":["/avatars/bc167fa6d467b979ee2c6aa6f046e229.svg"],"reactions":[],"isReport":false,"parentCommentId":"67a9903f8a919ad008ad7549"}}]},{"id":"67aa4c9c394d031d6375bfa8","author":{"_id":"67513cb7affa6791663fece9","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/Cj-zf7a7R_C5E_tvVQTc9.png","fullname":"Suraj Singh Chauhan","name":"surajssc1232","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false},"createdAt":"2025-02-10T18:59:40.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Holy","html":"Holy
\n","updatedAt":"2025-02-10T18:59:40.468Z","author":{"_id":"67513cb7affa6791663fece9","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/Cj-zf7a7R_C5E_tvVQTc9.png","fullname":"Suraj Singh Chauhan","name":"surajssc1232","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.7029705047607422},"editors":["surajssc1232"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/Cj-zf7a7R_C5E_tvVQTc9.png"],"reactions":[],"isReport":false}},{"id":"67aa69a4a6e6b3d852d2aecf","author":{"_id":"67818b1fa6b75c5dc3cf430c","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/67818b1fa6b75c5dc3cf430c/5aA0gP8ZvIkMndNA7CqqE.png","fullname":"Ribbit Ribbit","name":"ribbitribbit365","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":1},"createdAt":"2025-02-10T21:03:32.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"We made a deep dive video for this paper: https://www.youtube.com/watch?v=mwXIWcOXu8g. \n\"Kamehameha! Transform text into video—just like that!\"\nhttps://cdn-uploads.huggingface.co/production/uploads/67818b1fa6b75c5dc3cf430c/5jtYNiO8WwVpnLu-izFUw.mp4\n","html":"We made a deep dive video for this paper: https://www.youtube.com/watch?v=mwXIWcOXu8g.
\"Kamehameha! Transform text into video—just like that!\"
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
\nThe following papers were recommended by the Semantic Scholar API
\n- \n
- IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models (2025) \n
- Generative Video Propagation (2024) \n
- Open-Sora: Democratizing Efficient Video Production for All (2024) \n
- Pushing the Boundaries of State Space Models for Image and Video Generation (2025) \n
- Efficient Scaling of Diffusion Transformers for Text-to-Image Generation (2024) \n
- SUGAR: Subject-Driven Video Customization in a Zero-Shot Manner (2024) \n
- BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations (2025) \n
Please give a thumbs up to this comment if you found it helpful!
\nIf you want recommendations for any Paper on Hugging Face checkout this Space
\n You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: \n\n@librarian-bot\n\t recommend
Weights wen? 👀
\n","updatedAt":"2025-02-11T02:44:22.222Z","author":{"_id":"64c1c77c245c55a21c6f5a13","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64c1c77c245c55a21c6f5a13/d9zlSksf3TxWpBbb-r0fd.jpeg","fullname":"Reza Sayar","name":"Reza2kn","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":73}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.3637184500694275},"editors":["Reza2kn"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/64c1c77c245c55a21c6f5a13/d9zlSksf3TxWpBbb-r0fd.jpeg"],"reactions":[{"reaction":"🔥","users":["huggingandrew12","Pg4555978541","DealayLomoi"],"count":3}],"isReport":false}},{"id":"67ad7f67e50eff64158bfa7a","author":{"_id":"666817d5e74ad47b0b718fc9","avatarUrl":"/avatars/3a08c1288f46b0edfc17280124b289c1.svg","fullname":"Srinivasulu kethanaboina","name":"redfernstech","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":4},"createdAt":"2025-02-13T05:13:11.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Mahabharat war","html":"Mahabharat war
\n","updatedAt":"2025-02-13T05:13:11.581Z","author":{"_id":"666817d5e74ad47b0b718fc9","avatarUrl":"/avatars/3a08c1288f46b0edfc17280124b289c1.svg","fullname":"Srinivasulu kethanaboina","name":"redfernstech","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":4}},"numEdits":0,"identifiedLanguage":{"language":"de","probability":0.480182409286499},"editors":["redfernstech"],"editorAvatarUrls":["/avatars/3a08c1288f46b0edfc17280124b289c1.svg"],"reactions":[],"isReport":false}},{"id":"67adc8c7c382ca6dac7d1cca","author":{"_id":"679eb790575df6520d9ec766","avatarUrl":"/avatars/3dff0f39a4aa87815ebb401018f04beb.svg","fullname":"Febry Ardiansyah","name":"febryards","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false},"createdAt":"2025-02-13T10:26:15.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"LFG!","html":"LFG!
\n","updatedAt":"2025-02-13T10:26:15.191Z","author":{"_id":"679eb790575df6520d9ec766","avatarUrl":"/avatars/3dff0f39a4aa87815ebb401018f04beb.svg","fullname":"Febry Ardiansyah","name":"febryards","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.8548787832260132},"editors":["febryards"],"editorAvatarUrls":["/avatars/3dff0f39a4aa87815ebb401018f04beb.svg"],"reactions":[],"isReport":false}},{"id":"67b471460b33f6729933499a","author":{"_id":"65f19d6646f98cc188a75a4b","avatarUrl":"/avatars/37fbe76a0536ebdce4ab51d823f13d95.svg","fullname":"tevg","name":"tevg","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false},"createdAt":"2025-02-18T11:38:46.000Z","type":"comment","data":{"edited":true,"hidden":false,"latest":{"raw":"\n\n","html":"","updatedAt":"2025-02-18T11:40:27.283Z","author":{"_id":"65f19d6646f98cc188a75a4b","avatarUrl":"/avatars/37fbe76a0536ebdce4ab51d823f13d95.svg","fullname":"tevg","name":"tevg","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false}},"numEdits":1,"identifiedLanguage":{"language":"en","probability":0.33324918150901794},"editors":["tevg"],"editorAvatarUrls":["/avatars/37fbe76a0536ebdce4ab51d823f13d95.svg"],"reactions":[],"isReport":false}}],"primaryEmailConfirmed":false,"paper":{"id":"2502.04896","authors":[{"_id":"67a983ea9b72585dd12587fb","user":{"_id":"6412a33900634c4fe9873652","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6412a33900634c4fe9873652/Nmn_yRA1gGD2VO1YbSOYF.jpeg","isPro":false,"fullname":"Shoufa Chen","user":"ShoufaChen","type":"user"},"name":"Shoufa Chen","status":"claimed_verified","statusLastChangedAt":"2025-02-10T09:49:52.136Z","hidden":false},{"_id":"67a983ea9b72585dd12587fc","user":{"_id":"620f126891e167b068fa76f8","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/620f126891e167b068fa76f8/NaPyS5lFjgZYJZrWaf0OI.jpeg","isPro":false,"fullname":"ChongjianGE","user":"RhettGee","type":"user"},"name":"Chongjian Ge","status":"admin_assigned","statusLastChangedAt":"2025-02-10T15:54:30.233Z","hidden":false},{"_id":"67a983ea9b72585dd12587fd","name":"Yuqi Zhang","hidden":false},{"_id":"67a983ea9b72585dd12587fe","name":"Yida Zhang","hidden":false},{"_id":"67a983ea9b72585dd12587ff","user":{"_id":"656971db2f7ea4b5ac238169","avatarUrl":"/avatars/29eca045338f1b9a272c42cf10a62823.svg","isPro":false,"fullname":"Fengda Zhu","user":"zhufengdaaa","type":"user"},"name":"Fengda Zhu","status":"admin_assigned","statusLastChangedAt":"2025-02-10T15:55:24.292Z","hidden":false},{"_id":"67a983ea9b72585dd1258800","user":{"_id":"67b06737019b7825d9fb508e","avatarUrl":"/avatars/80502db1a7fba7398e08dacbf401f152.svg","isPro":false,"fullname":"Hanish","user":"Hannah12","type":"user"},"name":"Hao Yang","status":"claimed_verified","statusLastChangedAt":"2025-02-18T09:34:54.824Z","hidden":false},{"_id":"67a983ea9b72585dd1258801","name":"Hongxiang Hao","hidden":false},{"_id":"67a983ea9b72585dd1258802","name":"Hui Wu","hidden":false},{"_id":"67a983ea9b72585dd1258803","user":{"_id":"6673e67d65b9964067706db9","avatarUrl":"/avatars/45018a5fffa77643b7a6d476f6063151.svg","isPro":false,"fullname":"Zhichao Lai","user":"sgcc-chao","type":"user"},"name":"Zhichao Lai","status":"admin_assigned","statusLastChangedAt":"2025-02-10T15:53:38.146Z","hidden":false},{"_id":"67a983ea9b72585dd1258804","user":{"_id":"64832c6675779e269260e98e","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64832c6675779e269260e98e/r-d14egc7wRBBY7_pD9dr.jpeg","isPro":false,"fullname":"Yifei Hu","user":"yifeihu","type":"user"},"name":"Yifei Hu","status":"admin_assigned","statusLastChangedAt":"2025-02-10T15:53:30.624Z","hidden":false},{"_id":"67a983ea9b72585dd1258805","user":{"_id":"63f89398da440a47e9f6b782","avatarUrl":"/avatars/6e2b4994a59b38add1332cc07b0ff3de.svg","isPro":false,"fullname":"Ting-Che Lin","user":"dronchego","type":"user"},"name":"Ting-Che Lin","status":"admin_assigned","statusLastChangedAt":"2025-02-10T15:53:19.492Z","hidden":false},{"_id":"67a983ea9b72585dd1258806","user":{"_id":"6424ffce46d202ad3d918a67","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6424ffce46d202ad3d918a67/gmYmOA072fP_5cJLc9Qs4.jpeg","isPro":false,"fullname":"Shilong Zhang","user":"shilongz","type":"user"},"name":"Shilong Zhang","status":"admin_assigned","statusLastChangedAt":"2025-02-10T15:53:12.376Z","hidden":false},{"_id":"67a983ea9b72585dd1258807","name":"Fu Li","hidden":false},{"_id":"67a983ea9b72585dd1258808","user":{"_id":"67aa537bdc097a969e614493","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/67aa537bdc097a969e614493/A3LR9rOsOO5V5F7nscSr6.jpeg","isPro":false,"fullname":"Chuan Li","user":"chuanrichardli","type":"user"},"name":"Chuan Li","status":"claimed_verified","statusLastChangedAt":"2025-02-11T10:00:44.165Z","hidden":false},{"_id":"67a983ea9b72585dd1258809","name":"Xing Wang","hidden":false},{"_id":"67a983ea9b72585dd125880a","name":"Yanghua Peng","hidden":false},{"_id":"67a983ea9b72585dd125880b","user":{"_id":"640dc9bf8512ec51d7f0ac1a","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/640dc9bf8512ec51d7f0ac1a/sT4rdEoQbzfW6D3xDVdqt.jpeg","isPro":false,"fullname":"peizesun","user":"peizesun","type":"user"},"name":"Peize Sun","status":"claimed_verified","statusLastChangedAt":"2025-04-20T15:04:28.848Z","hidden":false},{"_id":"67a983ea9b72585dd125880c","name":"Ping Luo","hidden":false},{"_id":"67a983ea9b72585dd125880d","user":{"_id":"6344dcb1cd37e44d9ed46508","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6344dcb1cd37e44d9ed46508/J92UKSxKR3iziD2WJfih4.jpeg","isPro":false,"fullname":"Yi Jiang","user":"JiangYi","type":"user"},"name":"Yi Jiang","status":"claimed_verified","statusLastChangedAt":"2025-03-02T20:18:51.440Z","hidden":false},{"_id":"67a983ea9b72585dd125880e","user":{"_id":"661a80af3557013b638061d5","avatarUrl":"/avatars/4c551aeb223e257a5fc45b5b6c7ded49.svg","isPro":false,"fullname":"Zehuan Yuan","user":"sweetrabor","type":"user"},"name":"Zehuan Yuan","status":"admin_assigned","statusLastChangedAt":"2025-02-10T15:52:06.140Z","hidden":false},{"_id":"67a983ea9b72585dd125880f","name":"Bingyue Peng","hidden":false},{"_id":"67a983ea9b72585dd1258810","user":{"_id":"66dbf16d7ec0e5f42175dbcb","avatarUrl":"/avatars/d28477ac9f02b633300cd51dea78704f.svg","isPro":false,"fullname":"liuxiaobing","user":"xiaobinggg","type":"user"},"name":"Xiaobing Liu","status":"admin_assigned","statusLastChangedAt":"2025-02-10T15:51:52.195Z","hidden":false}],"publishedAt":"2025-02-07T13:03:55.000Z","submittedOnDailyAt":"2025-02-10T02:13:39.239Z","title":"Goku: Flow Based Video Generative Foundation Models","submittedOnDailyBy":{"_id":"60f1abe7544c2adfd699860c","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1674929746905-60f1abe7544c2adfd699860c.jpeg","isPro":false,"fullname":"AK","user":"akhaliq","type":"user"},"summary":"This paper introduces Goku, a state-of-the-art family of joint\nimage-and-video generation models leveraging rectified flow Transformers to\nachieve industry-leading performance. We detail the foundational elements\nenabling high-quality visual generation, including the data curation pipeline,\nmodel architecture design, flow formulation, and advanced infrastructure for\nefficient and robust large-scale training. The Goku models demonstrate superior\nperformance in both qualitative and quantitative evaluations, setting new\nbenchmarks across major tasks. Specifically, Goku achieves 0.76 on GenEval and\n83.65 on DPG-Bench for text-to-image generation, and 84.85 on VBench for\ntext-to-video tasks. We believe that this work provides valuable insights and\npractical advancements for the research community in developing joint\nimage-and-video generation models.","upvotes":105,"discussionId":"67a983ee9b72585dd125890f","ai_summary":"Goku, a state-of-the-art family of joint image-and-video generation models using rectified flow Transformers, sets new benchmarks in text-to-image and text-to-video tasks.","ai_keywords":["rectified flow Transformers","image-and-video generation models","text-to-image generation","text-to-video tasks","GenEval","DPG-Bench","VBench"]},"canReadDatabase":false,"canManagePapers":false,"canSubmit":false,"hasHfLevelAccess":false,"upvoted":false,"upvoters":[{"_id":"630412d57373aacccd88af95","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1670594087059-630412d57373aacccd88af95.jpeg","isPro":true,"fullname":"Yasunori Ozaki","user":"alfredplpl","type":"user"},{"_id":"634c48a63d11eaedd88c7c4b","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/634c48a63d11eaedd88c7c4b/R1yITk1vNVbqQ9yNpdagm.png","isPro":false,"fullname":"MrlolDev","user":"MrlolDev","type":"user"},{"_id":"6482836de4bbb1c2dd303297","avatarUrl":"/avatars/64377886fdfb22ef329c3a61b0390813.svg","isPro":false,"fullname":"JIN","user":"PikaJIN","type":"user"},{"_id":"631c386bc73939ffc0716a37","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1662793811119-noauth.jpeg","isPro":false,"fullname":"SeongWan Kim","user":"idgmatrix","type":"user"},{"_id":"619507e7b74b6c591f794340","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/619507e7b74b6c591f794340/JbPDoy6Ko1V1-6oJJwFV8.jpeg","isPro":false,"fullname":"Weiyun Wang","user":"Weiyun1025","type":"user"},{"_id":"66f612b934b8ac9ffa44f084","avatarUrl":"/avatars/6836c122e19c66c90f1673f28b30d7f0.svg","isPro":false,"fullname":"Tang","user":"tommysally","type":"user"},{"_id":"652b83b73b5997ed71a310f2","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/652b83b73b5997ed71a310f2/ipCpdeHUp4-0OmRz5z8IW.png","isPro":false,"fullname":"Rui Zhao","user":"ruizhaocv","type":"user"},{"_id":"647f36a8454af0237bd49574","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/647f36a8454af0237bd49574/jshkqBUTY-GZL8As8y6Aq.jpeg","isPro":false,"fullname":"Florent Daudens","user":"fdaudens","type":"user"},{"_id":"61868ce808aae0b5499a2a95","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/61868ce808aae0b5499a2a95/F6BA0anbsoY_Z7M1JrwOe.jpeg","isPro":true,"fullname":"Sylvain Filoni","user":"fffiloni","type":"user"},{"_id":"63f0baf66309c84d5f4a2226","avatarUrl":"/avatars/a122f7d92441bd2feef7d4eda993fab7.svg","isPro":false,"fullname":"Meme155","user":"Meme145","type":"user"},{"_id":"646b43deb1202bc77c1024a4","avatarUrl":"/avatars/cf791574ab986bac274e7fbcf04e2a59.svg","isPro":false,"fullname":"hangyu guo","user":"Rosiness","type":"user"},{"_id":"64bd994ef8f28a19b0d0acbe","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64bd994ef8f28a19b0d0acbe/igR6xy19rpgl2uqeh3iPa.jpeg","isPro":false,"fullname":"Snehasish Barman","user":"sbarman25","type":"user"}],"acceptLanguages":["*"],"dailyPaperRank":2}">Abstract
Goku, a state-of-the-art family of joint image-and-video generation models using rectified flow Transformers, sets new benchmarks in text-to-image and text-to-video tasks.
This paper introduces Goku, a state-of-the-art family of joint image-and-video generation models leveraging rectified flow Transformers to achieve industry-leading performance. We detail the foundational elements enabling high-quality visual generation, including the data curation pipeline, model architecture design, flow formulation, and advanced infrastructure for efficient and robust large-scale training. The Goku models demonstrate superior performance in both qualitative and quantitative evaluations, setting new benchmarks across major tasks. Specifically, Goku achieves 0.76 on GenEval and 83.65 on DPG-Bench for text-to-image generation, and 84.85 on VBench for text-to-video tasks. We believe that this work provides valuable insights and practical advancements for the research community in developing joint image-and-video generation models.
Community
Holy
We made a deep dive video for this paper: https://www.youtube.com/watch?v=mwXIWcOXu8g.
"Kamehameha! Transform text into video—just like that!"
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models (2025)
- Generative Video Propagation (2024)
- Open-Sora: Democratizing Efficient Video Production for All (2024)
- Pushing the Boundaries of State Space Models for Image and Video Generation (2025)
- Efficient Scaling of Diffusion Transformers for Text-to-Image Generation (2024)
- SUGAR: Subject-Driven Video Customization in a Zero-Shot Manner (2024)
- BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Weights wen? 👀
Mahabharat war
LFG!
Models citing this paper 0
No model linking this paper