What are Elasticsearch shards? Why do they matter? Elasticsearch cluster architecture explained.

Поделиться
HTML-код
  • Опубликовано: 11 сен 2024
  • Elasticsearch is a fantastic tool but it's easy to muddle through without knowing the fundamentals. It's only a matter of time before your cluster performance drops, errors start happening, and you wonder just what a shard actually is.
    I talk and wave my hands about, explaining how we could have - maybe - built - some of - Elasticsearch ourselves. At the end of the video you'll know what a shard is, why they're so important, and want to learn more to really improve the performance of your cluster.
    Watch on (and watch the rest of the free course linked below) to figure out how to answer that mysterious question: How many primary shards do I need for my index?
    My course - Fundamentals of Elasticsearch architecture and shards - is available for free here: school.georgeb...
    You can read more about Elasticsearch and The Elastic Stack in general on my blog: georgebridgema...

Комментарии • 66

  • @medovanx
    @medovanx 16 дней назад +1

    This is really one of the most useful videos that introduced ES to me.

  • @nch77884
    @nch77884 2 года назад +43

    Hands down the best explanation and introduction to Elasticsearch. Can't thank enough for making this video.

    • @GeorgeBridgemanData
      @GeorgeBridgemanData  2 года назад

      Thank you for your lovely comment. I'm so pleased you enjoyed the video!

    • @zaeemahmedabbasi
      @zaeemahmedabbasi Год назад

      ​@@GeorgeBridgemanData⁷77777777777777777777777777777777777777777777777777777777777777777777😅😮

  • @sridharnuthi1
    @sridharnuthi1 Год назад +7

    This is a video that should go to a reference library about Elasticsearch. Thank you for putting such a good, clear and methodical overview of ES. Just brilliant!

  • @netlob
    @netlob 2 года назад +13

    Holy sh*t! This must've been the best tutorial I've ever seen on RUclips. High production, clear presentation and well thought through. +1 subscriber for sure!

  • @Guille495
    @Guille495 15 дней назад

    Awesome explanation, I love your narrative style, it really underlines the why and how of the current ecosystem!

  • @HaisumUsman
    @HaisumUsman Год назад +3

    Man! You are not from this planet! You deserve a thousand thumbs up.

  • @lawlade
    @lawlade Год назад +3

    Watched this 5 times, rewound several times and i understand it FULLY. Thanks so much for such clear explaining

    • @GeorgeBridgemanData
      @GeorgeBridgemanData  Год назад

      You're welcome! I'm so pleased you understood everything. I hope it helps!

  • @harrisonleong4283
    @harrisonleong4283 2 года назад +3

    I really wish this was the 1st elasticsearch video that I had watched, so as to save me so much time watching other video which could not teach me the same level of information that I need. Thank you very much, and I shall check out your courses.

    • @GeorgeBridgemanData
      @GeorgeBridgemanData  2 года назад

      That's the reaction I was hoping for - a useful first video on Elasticsearch. Thank you so much for posting!

  • @TheOtmane007
    @TheOtmane007 Год назад +2

    What a clear , and progressively explained architecture. Thank you so much

  • @crujzojam7004
    @crujzojam7004 2 года назад +3

    Please post more videos…ur videos are easy to understand and quite informative….please carry on the good work

    • @GeorgeBridgemanData
      @GeorgeBridgemanData  2 года назад +1

      Done! New video just posted!
      There will be more. I have lots of ideas but I'm really trying to get this my Elasticsearch course finished, and work the day job.

  • @thomasanderson8478
    @thomasanderson8478 2 года назад +3

    This is the best explanation of elasticsearch I've ever seen. So many videos skip over the details, and it's been making it difficult to understand what elasticsearch is doing under the hood. I normally don't comment on videos, but this is too high quality not to. Please continue to put out content!

  • @cliffmathew
    @cliffmathew 9 дней назад

    Very clearly explained. Thanks

  • @blossomwithcurls
    @blossomwithcurls 8 месяцев назад

    I just started learning Elastic search and this is the best an clear information on Elastic search architecture. Thanks for sharing!

  • @Transactional
    @Transactional Год назад +1

    Thank you. It feels like my brain is getting clearer.

  • @jhoyl
    @jhoyl 2 года назад +1

    Thanks - the perfect introduction to Elasticsearch architecture.

  • @yazzy9975
    @yazzy9975 Год назад

    This video changed my life. No exaggeration.

  • @jupudivinod
    @jupudivinod 2 года назад +1

    This is fantastic! Bricks till walls in a nutshell! Thanks much for this great presentation.

  • @hieungo770
    @hieungo770 8 месяцев назад

    Its two year late but the lesson is extremely value

  • @pseudolimao
    @pseudolimao 4 месяца назад

    where was thsi video 1 month ago. you should be paid by these software companies... bless your heart

  • @thsu1
    @thsu1 2 года назад +1

    thanks for the clear and awesome explanation to Elasticsearch and Lucene. really appreciate this useful content

  • @tkousek1
    @tkousek1 2 года назад +1

    Thank you very much sir for this information. Awesome people like you are what's good about this world!!! Much appreciated!!!

  • @user-tm6cd8sg5l
    @user-tm6cd8sg5l 5 месяцев назад

    Many thanks for your work! It's awesome video!

  • @hnyc1986
    @hnyc1986 2 года назад +1

    Awesome explanation about Elasticsearch!!!

  • @sobhan285
    @sobhan285 2 года назад +1

    Wonderful. Looking forward to more courses from you.

    • @GeorgeBridgemanData
      @GeorgeBridgemanData  2 года назад

      Thanks so much! Elasticsearch Engineer Essentials is in the works, and I'll be posting shorter content on here as well.

  • @arpit9163
    @arpit9163 2 года назад +1

    Thank You for making this fantastic video !

  • @samrybkin9184
    @samrybkin9184 2 месяца назад

    Hi thanks for a video.
    For example we have: "unassigned_shards" : 40,
    When we run:
    GET _cluster/allocation/explain?filter_path=index,node_allocation_decisions.node_name,node_allocation_decisions.deciders.*
    {
    "index": "elastalert_past",
    "shard": 0,
    "primary": false
    }
    We reiceve next answer:
    "explanation" : "a copy of this shard is already allocated to this node [[elastalert_past][0], node[JaLzrdasdajQ], [P], s[STARTED], a[id=OmY9kwpHTlybJfSrWvdsadada6g]]"
    We have only one node and what we can do in this situation ?
    Also we have "number_of_replicas" : "0", "auto_expand_replicas" : "false", what we can do in this situation ?
    GET /.kibana/_settings
    {
    ".kibana_2" : {
    "settings" : {
    "index" : {
    "number_of_shards" : "1",
    "auto_expand_replicas" : "false",
    "provided_name" : ".kibana_2",
    "creation_date" : "1601664093",
    "number_of_replicas" : "0",
    "uuid" : "WKdIpzLFSP-ydObLw",
    "version" : {
    "created" : "7090299"
    }
    }
    }
    }
    }

  • @danielsantiago11
    @danielsantiago11 Год назад +1

    Premium content, thank you!

  • @rakeshkush1234
    @rakeshkush1234 2 года назад +1

    wonderful technical story.

  • @andy_ltluan
    @andy_ltluan Год назад +1

    I think that shard in ES has the same concept with partition in Kafka when they have all partition replicas in different nodes

  • @moritzlgrs401
    @moritzlgrs401 2 года назад +1

    Absolutely fantastic!

  • @cicd
    @cicd 2 года назад +1

    Great content, thanks for sharing!

  • @ucthuannguyen6432
    @ucthuannguyen6432 2 года назад +1

    Wonderful. Thank you so much.

  • @riazbacchus3962
    @riazbacchus3962 11 месяцев назад

    this is great content. thank you.

  • @johnsonakanbi367
    @johnsonakanbi367 3 года назад

    Thanks so much for this great presentation.

  • @PhanTanThangTH
    @PhanTanThangTH 6 месяцев назад

    Thank you so much :)

  • @Milostrosic
    @Milostrosic Год назад +1

    Very clear explanation!

  • @slapcanister
    @slapcanister 2 года назад +1

    This is so good.

  • @DrewIsFail
    @DrewIsFail 2 года назад +1

    Is it fair to say you could build ES from dynamoDB? I'm trying to compare the two.
    I would love a video on the query language, does it have a mathematical basis like sql does to sets?
    It goes without saying, but I'm say it, thanks for making this clear, concise, focused high level content.

    • @GeorgeBridgemanData
      @GeorgeBridgemanData  2 года назад

      Hi there. I'm really pleased you enjoyed the video.
      I'm not sure you could build an equivalent of Elasticsearch using DynamoDB. There's a *lot* more to Elasticsearch than I talked about in this video!
      There's more content coming, including an introduction to the query language. The in-depth content will be in a training course instead of RUclips, though. I've never considered if there's a mathematical basis to the query language. I doubt there is in terms of what Elasticsearch offers, but all Elasticsearch queries are converted to Lucene query language, which may be more thoroughly researched. Interesting question!

  • @yazzy9975
    @yazzy9975 Год назад

    If elasticsearch distributes the data between the shards of an index such that each lucene store roughly holds the same number of documents, when you run a search query, elasticsearch, despite the inter-node communication, only knows which shards hold that index and not which particular shard will have that document? So it has to run the query against all the shards and merge results, it cannot just search the one shard that contains that document? It does not know beforehand based on how documents are distributed among shards.

  • @carlosroberto366
    @carlosroberto366 2 года назад

    Isn't the cluster the server (i.e. AWS EC2 instance) itself? To my mind, a node is not a server because you can create several nodes in the same machine. I was expecting to see MyCluster1 and MyCluster2 each having a single node, hence, high availability via cross-cluster communication.
    11:11 node = server in his example
    15:05 node = process

    • @GeorgeBridgemanData
      @GeorgeBridgemanData  2 года назад +1

      A node is an Elasticsearch process running on a host. You're right that you can run multiple nodes on the same host (even not containerised), but it's not recommended and it's widely accepted that you only run a single node on a host.
      If you did run two nodes on a single host, you could have either one or two clusters on that host. The node is configured with the cluster name it's expected to join, so you could configure each node with a different cluster name and have two clusters on that host!
      Cluster formation can get quite involved. There are configuration settings that need to be applied specifically at the formation stage. I can do a video on how that works at some stage.

  • @akshaychawla7413
    @akshaychawla7413 Год назад

    I am not able to enroll for your course, tried with 2 different emails, please have a look into this.

    • @GeorgeBridgemanData
      @GeorgeBridgemanData  Год назад

      Sorry for the very late response. I've had feedback from a couple of people using Firefox, who worked around it by using a different browser. I'm not sure if that's your issue but thought I'd mention it. Let me know if you're still having issues and I'll try responding quicker this time!

  • @sv_n
    @sv_n 6 месяцев назад

    1000th like 😅