
- Видео 79
- Просмотров 40 951
StratusGrid
Добавлен 6 апр 2021
StratusGrid is a true strategic partner, not an IT vendor. We invest time and resources into improving security and providing operational excellence resulting in better outcomes for our clients in the form of decreased risk and increased throughput, consistency, resiliency, and scalability. At the same time, we will listen first, be responsive and agile, proactive with communication, and focus on the specific needs of our clients so that we can have successful client partnerships.
We help private equity firms and large enterprises with AWS cloud cost optimization 🚀
We help private equity firms and large enterprises with AWS cloud cost optimization 🚀
Configure Site-to-Site VPN Connectivity Between AWS and Microsoft Azure
In this video, we'll explore how you can configure a Site-to-Site (S2S) VPN connection between AWS and Microsoft Azure, using managed VPN Gateways from both services. We'll also set up virtual machines in both services, and ensure we are able to successfully route network traffic between virtual networks in both cloud vendors.
Learn more at stratusgrid.com
Learn more at stratusgrid.com
Просмотров: 57
Видео
Publish Linux Desktop Apps with Amazon AppStream Fleets (AWS)
Просмотров 68День назад
In this video, we'll explore the Amazon AppStream service and how you can provision Linux desktop applications to end users. We'll start by building a custom Rocky Linux image, with a couple of applications installed, and capture that as an image snapshot. Then we'll create an AppStream Fleet, Stack, and associate it with a user, and test running these applications through the AppStream interfa...
Cost Optimize Amazon EBS Snapshots | FinOps
Просмотров 40Месяц назад
Amazon EBS snapshots can easily be overlooked during cost optimization efforts. In this video, we will explore some mechanisms you can use to optimize the costs of EBS volume snapshots. For example, you can manually delete snapshots after a period of time, move snapshots to archive storage, automatically retain snapshots with lifecycle rules, or use the AWS Recycle Bin to terminate snapshots af...
Configure Allowed AMIs Policy / Filter in Amazon EC2
Просмотров 55Месяц назад
Amazon EC2 provides Amazon Machine Images (AMI) as a mechanism to capture an image of your EC2 instances and deploy new instances from. While many pre-built AMIs are provided for convenience, from AWS, AWS Marketplace, and publicly shared images from other AWS users, using third-party images can introduce security risks. The new "Allowed AMIs" feature in Amazon EC2 enables you to craft a specia...
Auto Stop & Start Amazon EC2 Instances for FinOps Cost Optimization
Просмотров 1242 месяца назад
There are many different methods of cost optimizing your AWS cloud spend. One of those methods is to automatically stop and start your Amazon EC2 instances when they're not needed. This is especially helpful for development VMs that are only actively being utilized during working hours. When development VMs are idle, they cost you money, but aren't actually providing any business value. We can ...
Cost Optimize Amazon EBS Block Storage Volumes
Просмотров 792 месяца назад
Amazon Elastic Block Store (EBS) volumes provide flexible block storage in the AWS cloud platform. These volumes are typically attached to Amazon EC2 instances so that applications can store and retrieve data from them. When running a FinOps program to evaluate the cost and efficiency of cloud resources, it's important to understand how EBS pricing works, and how to identify EBS volumes that ar...
Improve Upload Speeds with Amazon S3 Transfer Acceleration
Просмотров 2563 месяца назад
If you need to push large amounts of data up to Amazon S3 object storage, the default S3 bucket endpoint might not provide enough throughput. To solve for this, Amazon S3 buckets support a feature called Transfer Acceleration. The Transfer Acceleration feature leverages the Amazon CloudFront Content Delivery Network (CDN) architecture to enable faster upload performance from your applications t...
Deploy Owncast RTMP Streaming Server on Amazon EC2 Ubuntu Linux Instance
Просмотров 3734 месяца назад
In this video, we will explore how you can set up your own private streaming server. To accomplish this, we will self-host the Owncast open source RTMP streaming server software on an Amazon EC2 instance running Ubuntu Linux Server. We will use Docker Compose to define the container, ports, and volumes, and the run the service. We will also create a DNS record to point to the server and use Cad...
Deploy Ollama and OpenWebUI on Amazon EC2 GPU Instances
Просмотров 1,7 тыс.4 месяца назад
If you're interested in using Large Language Models (LLM) to generate text data, based on input prompts, but don't want to use managed services like OpenAI ChatGPT or Anthropic's Claude models, you can use open source technologies instead! In this video, we'll explore how you can set up an Amazon EC2 instance, with an NVIDIA GPU, install all the necessary dependencies, and deploy two open sourc...
Using the Conditional Node in Amazon Bedrock Prompt Flows
Просмотров 8414 месяца назад
In this video, we'll explore how to use the conditional node in Amazon Bedrock Prompt Flows! Learn more at stratusgrid.com
Integrate Amazon Bedrock Prompt Flows with AWS Lambda Serverless Functions
Просмотров 1,8 тыс.4 месяца назад
Amazon Bedrock offers a new Prompt Flows feature, which exposes a graphical, node-based workflow editor embedded inside the AWS Management Console. Using Prompt Flows, you can integrate Large Language Models (LLM) with other AWS services, such as Amazon Lex, AWS Lambda, Amazon S3 storage, and Bedrock Knowledge Bases. In this video, Trevor Sullivan (Solutions Architect, StratusGrid) demonstrates...
Intro to Amazon Bedrock Prompt Flow Builder | Visual Workflow Editor
Просмотров 2,3 тыс.4 месяца назад
Amazon Bedrock introduced a new feature in July 2024 called Prompt Flow. This is a graphical interface that enables you to build complex workflows utilizing large language models (LLM). In this video, Trevor Sullivan demonstrates how to utilize this visual workflow editor, in the AWS Management Console. Learn more at stratusgrid.com
Intro to Amazon S3 Express One Zone Directory Buckets and AWS SDK for Rust
Просмотров 2824 месяца назад
In late 2023, AWS announced a new feature for Amazon S3, called Express One Zone. As part of this feature launch, there is a new type of S3 bucket available, known as Directory Buckets. Unlike Standard S3 buckets, the new Directory Bucket type exists in a specific Availability Zone, instead of replicating data across an entire AWS region. Express One Zone storage is designed for workloads that ...
Build an Interactive Amazon SQS CLI Client with Rust
Просмотров 1885 месяцев назад
If you're a developer using Amazon SQS, and need to test out various payloads, you might want to write a custom, interactive CLI tool that simplifies queue access. Rust offers a crate called "inquire" which can prompt users to select items from a list, type some arbitrary text as input, or even pick a date from a terminal-based calendar. In this video, Trevor Sullivan (Solutions Architect, Stra...
Intro to AWS Storage Gateway | File Gateway
Просмотров 1,1 тыс.5 месяцев назад
AWS Storage Gateway, File Gateway is a cloud service that enables you to store and retrieve file data via standard Network File System (NFS) or Server Message Block (SMB) protocols. You can deploy applications in the same network as the File Gateway server, mount an NFS or SMB file share, and utilize it as network storage. While the gateway provides the local network storage interfaces, the fil...
Learn Amazon SQS Queue Attributes and APIs with Rust
Просмотров 1146 месяцев назад
Learn Amazon SQS Queue Attributes and APIs with Rust
Understanding Amazon EventBridge Event Bus Architecture
Просмотров 4697 месяцев назад
Understanding Amazon EventBridge Event Bus Architecture
Develop Rust Applications using Amazon SQS
Просмотров 2807 месяцев назад
Develop Rust Applications using Amazon SQS
Configure Amazon Bedrock Knowledge Bases with Pinecone Vector Database
Просмотров 3,8 тыс.7 месяцев назад
Configure Amazon Bedrock Knowledge Bases with Pinecone Vector Database
Configure Retrieval Augmented Generation (RAG) with Amazon Bedrock Knowledge Bases and MongoDB
Просмотров 1,8 тыс.7 месяцев назад
Configure Retrieval Augmented Generation (RAG) with Amazon Bedrock Knowledge Bases and MongoDB
Building Custom Amazon Bedrock Agents for Automation from Scratch
Просмотров 3 тыс.7 месяцев назад
Building Custom Amazon Bedrock Agents for Automation from Scratch
Provision an IPv6-only Amazon EC2 Application Load Balancer
Просмотров 3197 месяцев назад
Provision an IPv6-only Amazon EC2 Application Load Balancer
Overview of AWS Management and Governance Services
Просмотров 1567 месяцев назад
Overview of AWS Management and Governance Services
Deploy Lowest Cost Amazon RDS Instance for Dev-Test Scenarios
Просмотров 2637 месяцев назад
Deploy Lowest Cost Amazon RDS Instance for Dev-Test Scenarios
Intro to Amazon Bedrock Discussion with Trevor Sullivan and Matt Barlow
Просмотров 617 месяцев назад
Intro to Amazon Bedrock Discussion with Trevor Sullivan and Matt Barlow
Amazon S3 No Longer Charges for Certain HTTP Status Codes
Просмотров 707 месяцев назад
Amazon S3 No Longer Charges for Certain HTTP Status Codes
VMware and AWS part ways: What’s next?
Просмотров 8657 месяцев назад
VMware and AWS part ways: What’s next?
Discussion: Migrating from VMware Cloud on AWS to Cloud Native
Просмотров 1488 месяцев назад
Discussion: Migrating from VMware Cloud on AWS to Cloud Native
Architecting Cloud Solutions with AWS Local Zones
Просмотров 1308 месяцев назад
Architecting Cloud Solutions with AWS Local Zones
Hands-on With AWS Resource Access Manager (RAM)
Просмотров 6348 месяцев назад
Hands-on With AWS Resource Access Manager (RAM)
That's really high quality content! Tks!!!
enjoyed watching this tutorial. learned new features. two questions: what tool did you use to create an architecture diagram and how do you choose the topics to cover in your videos?
Thanks so much! I am glad you learned something new from this, as that's our goal! To answer your questions: 1) we used Lucid Chart to create the diagram for this video, and 2) that's a complex answer! Briefly, I look for opportunities / gaps in documentation that would benefit from having additional clarification. I produce high quality content to help fill in the gaps and add more clarity to those lesser-documented topics. I also do this on my personal channel. 🙂 Does that make sense?
@@TrevorSullivan i assumed you are showing the solutions to problems you've faced in you daily job, but the provided explanation totally makes sense. following both your channels, since the content you're creating correlates great with tasks i'm doing at my job. keep pushing! looking forward to watch next video!
Now show us the cost in AWS after 24 hours of use, lol
This is awesome video, thanks for your effort , looking forward
Thanks so much for your kind comment! I am glad you benefitted from this video.
Finding the right Atlassian partner in Dubai can transform team productivity. Enreap has been leading with expert solutions for seamless integration.
Thanks a lot for this amazing video man. I wasted a whole day with Claude sonnet and GPT o1 and 4 and could'nt get this done and your video finally helped me accomplish the task. One question, I got the same EC2 instance as in the video, but I wanted to know if I shut down the instance every night will the containers and models still persits on restart? or do I have to mount a separate EBS volume and save them there. This is what the AI's were tripping me up on.
Thanks, Chris! I'm glad this video helped you out. You can shutdown the EC2 instance and the containers should automatically come back up when you restart it. Docker persists data on the root filesystem, and the Docker Compose file we created has the restart=always option set, so the containers will automatically come back up. Everything should just pick back up where you left off.
Thank you Trevor.
hello sir i need your help on this topic i have elastic beanstalk enviroment i want ec2 instance only ipv6 addresses because aws now charge on ipv4 address but i am stuck on this pls help
Could you please tell me what web server the web application is running on? Is it Nginx or Apache?
Great video! Do you have or know of any guides on how to do this in cdk? Thanks for the great content!
Welcome back! :)
Best video for the topic on the internet. Thank you so much!
Hi txs for your very clear video, i am worry about the ec2 m5.xlarge price for a demo test , any recommendation? Gracias
Hello Felix, you can spin up m5.xlarge using EC2 Spot Instances and it costs much less. Also, make sure you shutdown or terminate the EC2 instance right away when you're done using it.
@TrevorSullivan txs for your answer
Hey Trevor, great tutorial, I've learnt a bunch of new stuff and was able to run my Ollama server on GPU, however I've noticed that when I ask purely textual requests Ollama runs 100% on GPU, while if I ask llama3.2-vision to describe an image, it does it 49%CPU/51%GPU, and as such it's really slow. Do you happen to know how we can force llama3.2-vision to use 100% GPU? Thanks!
Hey there, thanks for your feedback on the video. I don't have enough experience with Ollama to answer that question. Have you tried starting a discussion on the Ollama GitHub repository? Maybe someone else has already noticed this behavior.
so helpful. wanna try this. hopefully ec2 free trial can be used to test this
Great explanation and well presented! Thanks for leaving in the parts where you struggle to get it to write something executable. It’s important for us to understand the reality of using these tools and how much actual time will be spent tweaking the prompts. Also gives a realistic sense of the reliability of LLM output.
Could you make another video based on the current state of offerings available for prompt flow?
Thank you so much Sir!! Your tutorials are great!! Newly, thank you!! Subscribed
We can’t local folder be used ??
What an amazing tutorial. Thank you man!!! Super helpful
I am getting an error when trying to use Claude as a model to test the Knowledge Base saying: "Your request rate is too high. Reduce the frequency of requests." Any idea what this might be due to?
Hmmm that's odd. Have you tried any other models?
@@TrevorSullivan It seems it's been an issue as of late. I did try Titan and that did work - still no luck with Anthropic though.
@@mirabdullah138 that's weird. I usually use Claude Sonnet. Maybe put in a support ticket and see if they can help? It doesn't make sense that you would get that error on a service that you pay for based on actual usage.
This kind of tutorial I wish I had back in a day I learned AWS! As alwasys really informative video!
Guys at 11:47 When it came time to install nvidia-utils, I had problems with the library and driver version (nvidia-utils-550) [message: Failed to initialize NVML: Driver/library version mismatch NVML library version: 550.107]. The solution was to check the package names and I saw that "nvidia-utils-550-server" existed and that's the one I installed and it worked. I assume that the distributions have been updated and now it has to match the driver that was installed (nvidia-driver-550-server).
Thanks for sharing your solution! I think I've run into this before, where the driver and utility versions mismatched.
great tutorial, excellent explanations! thank you sir!
Ran into this guy when i was teaching myself rust a while back, was super happy for him to pop up here as well. Content is super high quality.
Thanks for sharing your kind comment! BTW I am also a motorcycle rider, but I haven't been riding the last couple years due to health problems. Enjoy your bike! 🙂
Amazon needs to improve their setup instructions. So much time lost until finding your video. Thank you.
Amazing Explanations Thanks!
Awesome. Very thorough and impressive. Thank you.
Great content, thanks! Couple questions: 1. What do you use to track your clipboard history? That was neat 2. Say you generate a flow you really like. How do you productionize it afterwards? Particularly, what is supposed to be the usable artifact of building one of these flows? Is it to build out a library of prompts, and this flow was used just to run the process end to end?
Thank you! Here are answers to your questions. 1. I use the built-in Windows 11 clipboard manager. You can use WIN + V to activate it. 2. I'm guessing they intend to release APIs to access your Prompt Flows, although it seems they're completely missing during the preview. 🤷🏻♂
in my rust on aws jorney it turned out that rust sdk indeed requires two hops to fetch credentials, which is not the case for boto3, which worked out of the box in the same container, where i tried to run rust app. now revisiting this video for a deep dive on a topic.
How can I timely invoke agent after a certain, after every 10 min for that too I have to declare the lambda function .
can you share all your install commands and docker compose yaml file?
Informative
Hi sir , please help me.. may i know in aws Console how to check what version if instance metadata service my ec2 is using, i am looking to get information it should show imdsv2.. please let me know where to check in aws Console
Hello, you can see this setting in the instance details, around the 15:30 mark in the video.
Thanks, very clear. I wish it had an option to build and deploy from local machine.
Nice walkthough; question: is it a requirement to use IPV6? Can we use v4 instead?
Yes of course you can use IPv4! As I mentioned in the video, I just use IPv6 because my ISP (Starlink) supports it, and I try to encourage its usage broadly. Thanks for your kind comment.
Thank you for the response!
Can’t we change the output of the prompts so that we can supply that output to the iterator? Currently it’s not allowing it to pipe with iterator as iterator is expecting array as the input type
I haven't been able to figure out how to use the iterator yet. Last I checked, the documentation was lacking. Any examples of how to use it. This feature is still in preview unfortunately.
Very great tutorial, was looking for the exact same thing (y)
Can't we use the hosted OpenWebUI web interface to download additional or remove existing model instead of exec'ing into the docker container ?
Hello Prashant, yes I believe that's possible! I am pretty new to OpenWebUI, and am more familiar with using Docker commands, so that's why I went this route in the video. Great point!
@@TrevorSullivan that’s really very useful tutorial, probably you’re the first (and only yet) to make tutorial on this. Thanks a lot! Hosting my own LLM soon, I do have 1 qq though. I was reading through some AWS posts and found some details about Inf1 and Trn1 (Trainium and Inferentia) types AWS recommending, and wanted to know your thoughts on which would be the best option to go with for the instance type Inf1 or the G type ? Would love to hear your thoughts on this 🙂
You definitely can. I have been doing that a lot.
@@AndrewCLatham thank you 👍🏻
Why not use Inf1.xlarge for the EC2 instance type instead?
How much cost for this external private CA bundle?
The private CA certificate doesn't cost you anything. You just generate it with the open source "step" CLI tool. Is that what you're asking?
What’s is the cost for acceptable spec.
This is what I want to know, is this practical?
This was really helpful. Very clear. No rushing through. Can you make a video on how to register both on-prem database like Postgres and RDS database to AWS Datazone?
Hello Manish, thanks so much for your kind comment, and suggestion for another topic! I love the idea to create a video on how to register a relational database with AWS DataZone. I haven't explored that service very much yet, but it would be a great learning opportunity, as always! 😊
Spot on. Thanks! 🙂
Hello Olav, thanks for checking out our videos! I'm glad this was useful for you. What other topics would you be interested in seeing?
Brilliant!
Thanks so much, Andrew! I'm glad this was helpful. 🙏🏻
Don't this have any memory concept suppose i say my name is shubham and in next query i ask it what's my name
Nice! Thanks for your good initiative.
I think you could set "OLLAMA_BASE_URLS=ollama:11434" for the WebUI (instead of using the IP)
Your conversation flow is smooth as Prompt flow :). Good job mate. Subscribed
the previous video for me was like clicky-clicky in the console, but this one makes definitely more sense since due to integration with lambda (as you mentioned) the use cases are limited by imagination. I am not yet that familiar with bedrock api and i ask myself how can i integrate this feature in a real project, since neither me, nor the users of my app will login to AWS to play with the workflow. Would be great if you could also mention this in future videos. Thanks for great content! Keep going!
Interesting one. I am curious how complicated the flow will be in real life scenarios and if it’s manageable with all those prompt and debug units
Yeah, I think basic workflows will be easier. It's nice to be able to "split" your LLM logic into multiple components though, rather than having to orchestrate these kinds of workflows with your own custom logic. I see some potential here, if they invest in improving its capabilities. They will also need to provide a mechanism to export / import workflows, so they can easily be shared. Maybe I missed this option, or they'll add it later?