- Видео 77
- Просмотров 68 039
NewGenAI
Индия
Добавлен 5 янв 2024
🚀 Welcome to StableAIHub - Your Gateway to AI Innovation! 🤖✨ Dive into the forefront of artificial intelligence and explore the fascinating world of Stable Diffusion with us. Uncover the magic where stability meets creativity, as we unravel the secrets of generating stunning images from text prompts. Whether you're an AI enthusiast, a tech explorer, or a creative mind seeking inspiration, you're in the right place. Join our community, stay updated on the latest breakthroughs, and embark on a journey of discovery in the ever-evolving landscape of AI. Subscribe now and let's shape the future together! 🌐🔍 #StableDiffusion #AIInnovation #TechExploration
Can LTX-Video Create Stunning Text-to-Video on Low VRAM (6/8 GB)? Find Out Now!
LTX-Video
github.com/Lightricks/LTX-Video/
Installation guide
drive.google.com/file/d/18lEmS3tP1ZMeElYhEyctk8MhTv57yq27/view?usp=sharing
#AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #texttovideo #LTXVideo
0:00 Benchmark
0:09 Introduction
0:36 Installation on Windows
github.com/Lightricks/LTX-Video/
Installation guide
drive.google.com/file/d/18lEmS3tP1ZMeElYhEyctk8MhTv57yq27/view?usp=sharing
#AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #texttovideo #LTXVideo
0:00 Benchmark
0:09 Introduction
0:36 Installation on Windows
Просмотров: 766
Видео
Allegro Quantized: Text-to-Video Model Now Runs on 8GB VRAM!
Просмотров 1987 часов назад
Allegro huggingface.co/rhymes-ai/Allegro Installation guide drive.google.com/file/d/1mNE1S6VKQKOtkYHn_XqLkPVuts4HSQE4/view #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #texttovideo #allegro 0:00 Introduction 0:31 Benchmark 2:0...
Can NVlabs SANA Generate 4096x4096 Images on Just 8GB VRAM? Let’s Find Out!
Просмотров 38612 часов назад
NVlabs SANA github.com/NVlabs/Sana Installation guide drive.google.com/file/d/1R6K_-vRen5BL-PXijnO8UzYQK7FG0cHh/view #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #texttoimage #sana #NvlabsSana 0:00 Introduction 0:33 Benchmark ...
Flux on 8GB VRAM? Witness the Magic of Lightning-Fast Image Generation using Nunchaku / SVDQuant
Просмотров 481День назад
Nunchaku / SVDQuant github.com/mit-han-lab/nunchaku Installation guide drive.google.com/file/d/1qtr00-PusMrbdNz5mBs7bCh_THg5VufG/view?usp=sharing #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #texttoimage #fluxdev #fluxschnell ...
Pyramid Flow: Lightning-Fast Video Generation from Text or Images - Only 8 GB VRAM Needed! Windows
Просмотров 69021 день назад
Pyramid Flow github.com/jy0205/Pyramid-Flow Updated files drive.google.com/file/d/1S_eh_TadJ1If26DTcmYdBvJdHdkBmQmK/view?usp=drive_link #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #texttovideo #imagetovideo #pyramidflow #text...
OmniGen: Transforming Multi-Modal Prompts into Stunning Visuals on 8GB VRAM
Просмотров 2,1 тыс.28 дней назад
OmniGen github.com/newgenai79/OmniGen/ Installation guide drive.google.com/file/d/17mFxfAj3JH0Wfr-Ouf618bufiPl03eKN/view #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology 0:00 Introduction 0:50 Benchmark on 8GB VRAM 1:05 Installation
The Beginner's Guide to Creating Your Own Talking-Head / Lip sync videos using EchoMimic
Просмотров 630Месяц назад
Forge github.com/lllyasviel/stable-diffusion-webui-forge EchoMimic tutorial ruclips.net/video/WtHdvSSQlWo/видео.html Extract frames ffmpeg -i video.mp4 -vf fps=30 input\%d.png Combine frames after post-processing ffmpeg -framerate 30 -i %d.png -vcodec libx264 -crf 1 video.mp4 #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInA...
Ctrl-X: Revolutionizing Text-to-Image Control Without Guidance
Просмотров 159Месяц назад
Ctrl-X github.com/genforce/ctrl-x Installation guide drive.google.com/file/d/1KdxQkjWQaPvgBTS4YGBV3ewMUjL477E2/view?usp=drive_link #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #CtrlX #T2IGeneration #StructureControl #Appearanc...
CtrLoRA Explained: Next-Level Control for Your Text-to-Image Creations!
Просмотров 409Месяц назад
CtrLoRA github.com/xyfJASON/ctrlora Installation guide drive.google.com/file/d/14fwXYLkbEcd1FHjOOPxMunpIkCW9zDTK/view?usp=sharing #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #CtrLoRA #ImageGeneration #EfficientAI #Controllabl...
Meissonic: Lightning-Fast 1B T2I Model for Jaw-Dropping 1024x1024 Images on Consumer GPUs!
Просмотров 318Месяц назад
Meissonic github.com/viiika/Meissonic Installation guide drive.google.com/file/d/1qTiJm_4az_ud4rCKxM6xZFzTLkwDnFx6/view?usp=sharing Gradio WebUI drive.google.com/file/d/1cgFhMKpDicF-lUV8xzRDZMhemXQ49oEd/view?usp=sharing #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthro...
The BEST voice cloning app ever? Clone Any Voice with F5-TTS: The Most Accurate TTS Yet!
Просмотров 3,1 тыс.Месяц назад
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching github.com/SWivid/F5-TTS Fix NUMPY package version pip install force-reinstall -v "numpy 1.25.2" Quick installation guide 1. Clone and navigate inside the folder 2. Create virtual environment python -m venv venv 3. Activate virtual environment venv\scripts\activate 4. Install Wheel pip install wheel 5. Install require...
From Low to Pro: Frame Interpolation with REAL-Video-Enhancer on Windows
Просмотров 259Месяц назад
REAL-Video-Enhancer github.com/TNTwise/REAL-Video-Enhancer #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #VideoEnhancer #FrameInterpolation #Upscaling #REALVideoEnhancer #VideoEditing #RIFEESRGAN #AIUpscaling #AiVideoInterpolat...
Think 8GB VRAM Can't Handle Controllable AI Generation? Naaaaaah! Introducing ControlNeXT SVD
Просмотров 2,2 тыс.Месяц назад
ControlNeXT github.com/dvlab-research/ControlNeXt/ ControlNeXt-SVD-v2 for Low VRAM systems (atleast 8 GB VRAM ) 8 GB shared github.com/newgenai79/ControlNeXt-SVD-v2 #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #ControlNeXT #AI...
Makeine Magic: Create Reels & Shorts from Just a Text Prompt!
Просмотров 125Месяц назад
Makeine github.com/Kither12/Makeine Updated files for Windows drive.google.com/file/d/1hhqBADXnufZzbTfROl92dxv-6fDE9QSK/view?usp=sharing ImageMagick for Windows imagemagick.org/script/download.php#windows #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearc...
Deep Live Cam: Face Swaps for Live camera, Images, Videos, and Multiple Faces!
Просмотров 680Месяц назад
Deep-Live-Cam github.com/hacksider/Deep-Live-Cam Fix for transparent window github.com/hacksider/Deep-Live-Cam/issues/668 #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #DeepLiveCam #FaceSwap #RealTimeFaceSwap #ImageToVideo #Liv...
SadTalker: Audio-Driven Single Image Talking Face Animation on Windows
Просмотров 1 тыс.Месяц назад
SadTalker: Audio-Driven Single Image Talking Face Animation on Windows
OOTDiffusion: The Future of Virtual Try-ons with AI Fashion
Просмотров 8012 месяца назад
OOTDiffusion: The Future of Virtual Try-ons with AI Fashion
ResShift: Lightning-Fast Super-Resolution & Face Restoration
Просмотров 3072 месяца назад
ResShift: Lightning-Fast Super-Resolution & Face Restoration
Master Voice Cloning with CosyVoice: Multilingual AI for Realistic Speech Generation
Просмотров 8112 месяца назад
Master Voice Cloning with CosyVoice: Multilingual AI for Realistic Speech Generation
Unlock Emotions in Talking-head Videos with EDTalk
Просмотров 9983 месяца назад
Unlock Emotions in Talking-head Videos with EDTalk
AniTalker: Lightning-Fast Talking Head Animations with Unique Facial Motion Encoding
Просмотров 9493 месяца назад
AniTalker: Lightning-Fast Talking Head Animations with Unique Facial Motion Encoding
Ultimate Vocal Remover: Effortless Vocal Extraction with Deep Neural Networks
Просмотров 2053 месяца назад
Ultimate Vocal Remover: Effortless Vocal Extraction with Deep Neural Networks
Make Backgrounds Disappear: Quick and Easy Transparent Background Tool | Powered by InSPyReNet
Просмотров 2493 месяца назад
Make Backgrounds Disappear: Quick and Easy Transparent Background Tool | Powered by InSPyReNet
AICoverGen: Create Song Covers with RVC v2 AI Voices!
Просмотров 5284 месяца назад
AICoverGen: Create Song Covers with RVC v2 AI Voices!
EchoMimic Magic: Audio and Landmarks Bring Portraits to Life! The BEST talking head generation app.
Просмотров 2,6 тыс.4 месяца назад
EchoMimic Magic: Audio and Landmarks Bring Portraits to Life! The BEST talking head generation app.
How to Create Perfect Lipsync Videos with LipSick
Просмотров 5044 месяца назад
How to Create Perfect Lipsync Videos with LipSick
FSRT: AI-Powered Next-Gen Face Reenactment Technology
Просмотров 4664 месяца назад
FSRT: AI-Powered Next-Gen Face Reenactment Technology
LivePortrait: Create Hilarious Portrait Animations Effortlessly!
Просмотров 3,5 тыс.4 месяца назад
LivePortrait: Create Hilarious Portrait Animations Effortlessly!
MimicMotion: Revolutionizing Human Motion Videos
Просмотров 2,8 тыс.4 месяца назад
MimicMotion: Revolutionizing Human Motion Videos
Hallo: Breakthrough in Audio-Driven Portrait Animation
Просмотров 1,8 тыс.4 месяца назад
Hallo: Breakthrough in Audio-Driven Portrait Animation
Hey thanks for this, while I'm running it on my system I'm encountering an error: Traceback (most recent call last): File "/home/sr/sarvesh/videoGen/LTXVideo/inference.py", line 369, in <module> main() File "/home/sr/sarvesh/videoGen/LTXVideo/inference.py", line 231, in main vae = load_vae(vae_dir) File "/home/sr/sarvesh/videoGen/LTXVideo/inference.py", line 39, in load_vae vae_state_dict = safetensors.torch.load_file(vae_ckpt_path) File "/home/sr/anaconda3/envs/LTXVideo/lib/python3.10/site-packages/safetensors/torch.py", line 313, in load_file with safe_open(filename, framework="pt", device=device) as f: safetensors_rust.SafetensorError: Error while deserializing header: HeaderTooLarge any workaround for this???
Thanks friend, I will try this, I hope it will run with 6 GB of GPU, I will tell you later if it was possible. Your channel is valuable, glad I found you
Thank you
Does it work on AMD?
Unfortunately, No.
can i run this on 1050 gtx 4gb VRAM?? with t5xxl_fp8_e4m3fn.safetensors text encoder and ltx-video-2b-v0.9.safetensors as checkpoint??
You will have to make code changes to load FP8 model.
OMFG this is fast! I wish I could run in comfy!
Yes you can run on Comfyi, there is a workflow on their github page github.com/Lightricks/LTX-Video
Can it run on comfyui ?
This is for standalone installation. You can find Comfyi workflow in their github repo.
Yes
Yes you can run on Comfyi, there is a workflow on their github page github.com/Lightricks/LTX-Video
hi there , how do do image to video with this , thanks
I have not tested if it is working. try 1. Copy the image in LTXVideo folder 2. In config.yaml update the image file name input_image_path: "" 3. Update prompt and other settings. Please let me know if it works. If not I will have a look
Tested, Verified I2V working.
You have a new subscriber, I'm glad to know this, I would like to know, which of these two tools do you consider to be more efficient and achieves better results? Allegro or Pyramid Flow?
Thank you. Just wait for another day or 2, new t2V tool is working on low VRAM. I am preparing guide.
@@StableAIHub Thanks friend, I will be attentive to that, since I'm interested in these tools requiring few resources, I have an RTX with only 6GB of ram, but I have been able to do things with some of these tools, at the moment with LivePortrait I have been able to make videos (at 12 fps), but generating videos in another way I have not tried it yet
@@Crisisdarkness LTX-Video made changes to make it work on 6 GB too provided you have atleast 16 Gb shared available. Checkout and let me know if it works for you
@@StableAIHub There is one thing that I don't understand, when you refer to having 16 GB shared, how can I do that? In the "performance" tab, I see shared GPU memory usage of 11 GB, could I increase it in some way?
@@Crisisdarkness If it is working than fine. That means enough memory is available to load the models. To increase shared memory your motherboard should support it. Search in google and see if you can find.
I came to find out about this great news late, I thought that to generate videos locally I had to have a monster GPU, this is great, I'll try it, it seems that this tool is efficient and could improve even more
thanks
Doesn't work with the “forge” version The only way I have found to use a VAE with the correct model is as follows: Give the VAE the same name as the model checkpoint.
Hi, i'm stuck - after i run python app.py and fetching 10 files 100% - i don't see the line: 127.0.0.1:port - please help - thanks
Nice as usual, keep going friend ❤
Thank you. If you choose to try please do post what VRAM and processing time.
I followed your instructions exactly but it will not work I even installed Anaconda to the latest version. ModuleNotFoundError: No module named 'gradio' ModuleNotFoundError: No module named 'spaces' ModuleNotFoundError: No module named 'torchvision' ModuleNotFoundError: No module named 'transformers' ModuleNotFoundError: No module named 'app' ModuleNotFoundError: No module named 'app.sana_pipeline'
I also faced this issue. Solved using Launch command prompt from within cloned repo activate Virtual environment conda activate sana pip install -e .
Please try the steps as mentioned by @trishul1979 and let me know if you are still facing issues
@@StableAIHub It does not work for me on Windows 11 The installation asked about a gated model gemini 2. and now when I followed every step correctly I am getting this error with a new latest Anaconda installation: "ImportError: DLL load failed while importing libtriton: A dynamic link library (DLL) initialization routine failed."
@@StableAIHub I have the same issue on win 11 : "ImportError: DLL load failed while importing libtriton: A dynamic link library (DLL) initialization routine failed."
@@playgnition3071 Surprising. I have been using these wheels for a long time now, works each time. I suggest install from original repository as they have removed triton dependency
Where is the bat file mentioned in your instructions?
It's in github. Once you follow the steps you will see it.
@@StableAIHub Thank you :)
@@ArtificialDevLabs u r welcome
Not run on colab t4 with 16gb viga
Sorry this tutorial is for Windows not for Collab
When will version 0.6b be released and how can I use version 1.6b in Colab without problems?
They will release in some time. I don't know about collab.
please i'm interested in your techniques for running notebooklm
Sorry I only know Windows installation.
Sana-0.6B?????? on colab t4
0.6B model is not released, only 1.6B is released. I only know Windows installation. You can check here github.com/NVlabs/Sana/issues/33
Not run on colab t4 how to run it on colab t4 with sana 0.6b
0.6B model is not released, only 1.6B is released. Please refer this github.com/NVlabs/Sana/issues/33
Running app_sana.py gives an module not found error. Of gradio, Then spaces Then torchvision Then app.sana pipeline. How to fix this ?
Did you activate conda activate sana? if still issue Launch command prompt from within cloned repo activate V.E. pip install -e .
In colab t4 not run
Please check here github.com/NVlabs/Sana/issues/33
Does this work with hindi?
You can fine tune for Hindi language. Refer the Discussion tab on their github.
When it was released everything was Linux specific. Thank's for making it work on Windows, I will try tomorrow.
With his instructions it does not work on Windows 11.
@@ArtificialDevLabs It is working fine. Just see the other comment you made, I followed the steps and it worked fine.
@@trishul1979 It does not work for me on Windows 11 The installation asked about a gated model gemini 2. and now when I followed every step correctly I am getting this error with a new latest Anaconda installation: "ImportError: DLL load failed while importing libtriton: A dynamic link library (DLL) initialization routine failed."
getting error " need conda init first before activating conda" when trying to activate conda omnigen
Don't use powershell, use command prompt.
I got errors when following your exact steps. Turned out I was missing the Visual Studio Build Tool (Make sure to select C++ for Desktop when installing them, otherwise it will fail!)
Where to put the model?
The models are automatically downloaded depending on which app you launch, you don't have to do anything. I have covered this in the video.
I get "you need to call conda init first" but if I do it give me another error.
For which step are you getting this error?
@@StableAIHub For `conda activate omnigen"
Большое спасибо за данное видео!
Glad you find it useful.
Mine said Error code:2 Runtimeerror:couldn't install torch it was so close to finishing only 3 gb left😭
Use Forge github.com/lllyasviel/stable-diffusion-webui-forge
Damn dude this looks awesome! I was just looking for a way to do upscaling on some older tv shows and do offline interpolation on some 4k content, as svp RIFE won't do 4k in realtime properly. I'll be trying this out this weekend hopefully, and I'll let you know how easy it was to use, and how much I enjoyed it. Thank you for making something like this <3!
I actually heard the conversation thrice, it doesn't look like TTS. How come there are expressions, laugh, etc..
Google notebook lm
Did you recorded the conversation for each speaker separately? This looks really good.
How did you generated the conversation used during installation guide. It is really amazing. Please make it little slow next time.
Yeah conversation is life-like. Doesn't look like recorded separately.
This is google notebook lm
@@KimiMorgam Please share the link
@@trishul1979 google for notebooklm google?
Awesome, thank you so much, this tutorial is so convenient and easy!
Hi, thanks for the tutorial, I don't have the webui_en file in the folder? where did you get it from?
It's in the video description. "Additional files"
@StableAIHub Thank you I manage to install everything with some GPT help. By the way, After generating the audios, which are great, i press download but it creates an empty file, OK .wav file. Any ideas?
@@content1 For functional issues please post here github.com/FunAudioLLM/CosyVoice/issues
Very good tool.
Awesome.
Working fine on 12 GB VRAM and very fast too. Appreciate the guide.
Working fine. Appreciate the easy tutorial.
Glad it helped
Thank you. Very easy guide.
what tool you used to lipsync generation
ruclips.net/video/iVy2bXPQNKY/видео.html
something better than liveportrait is coming. X-Portrait 2: ByteDance’s AI Lip-Sync Tool
I failed to install X-Portrait and looking at the comments the output was very bad. Let's hope v2 is better. Please let me know when it's released.
As your videos progress I can see how your AI mascot lady gets better and better with each revision. The lips are moving more naturally for this one. I'm sure the expressions will get more nuanced soon. So within 1 year you will automatically have a archive of footage of how the mascot went from primitive to fully fleshed AI human clone. Cheers. ps : Can you do a video on the recent Facebook sapiens ? b
I know. Ai is progressing too fast. EchoMimic is the best in terms of skin textures and neck movement. I figured changing some settings produce even better results, did a video for that. Let me check github.com/facebookresearch/sapiens
use pinokio
Does anybody faced the problem with "CUDA out of memory" effect and have you set this environment variable to ty to fix it - "PYTORCH_CUDA_ALLOC_CONF=max_split_size_mb:512" Does the model could be optimized somehow to work on 8GB VRAM?
The project is not updated for a long time. try OOTDiffusion, video is posted in this channel.
Thank you
Welcome
what image generation AI did you use at 0:43? It is really nice. The woman looks realistic
Don't remember, I'm using this image in all my videos. Might be chilloutmix in Auto1111.
How to install in python 3.9 ?
Have you tested it it is working with python 3.9? You can install miniconda and then Step 3: conda create -n ootdiffusion python==3.10 Step 4: conda activate ootdiffusion Rest everything remains same
Can you provide a video on how to run the inference with nemotron 70b unsloth 4bit on vram 8GB or on colab t4
What is Nemotron. I never heard about this. What is this used for?
@@StableAIHubnemotron is a massive large language model developed by nvidia and u can run it locally if u have enough ram
@@Gamatoto2038 I have not came across this so far as I am focusing on SD only. Will see if there is any case study I can do around this.