Fuzzy match / merging in Power BI Desktop (October 2018)

Поделиться
HTML-код
  • Опубликовано: 16 окт 2024
  • In this video, we look at the new fuzzy match / merging option within the October 2018 version of Power BI Desktop. This feature is incredibly powerful when it comes to data matching and can be adjusted to meet your needs.
    LET'S CONNECT!
    Guy in a Cube
    -- guyinacube.com
    -- / guyinacube
    -- / guyinacube
    -- Snapchat - guyinacube
    -- / guyinacube
    **Gear**
    Check out my Tools page - guyinacube.com...
    #powerbi #powerbidesktop #guyinacube

Комментарии • 63

  • @jimfollen1731
    @jimfollen1731 9 месяцев назад

    You guys rock, love your PBI videos. They have become my go-to when I need to learn new techniques or functionality. Thanks!

  • @marjoriejorgensen801
    @marjoriejorgensen801 2 года назад

    Love your videos. Your energy/personality and training style is easy to follow and VERY MUCH appreciated! Keep it up.

  • @krupeshdesai
    @krupeshdesai 5 лет назад

    Great Videos Guys. Thank you for helping me to get my Power BI certification done. I am writting this from New Zealand and the Desktop Image in the background is just 2 hour drive from my home. It's called "Wharariki beach"

  • @VenkateshThammisetty-k4w
    @VenkateshThammisetty-k4w Год назад

    It's a good explanation Patrick!

  • @kirstyclark9439
    @kirstyclark9439 6 лет назад +6

    It'd be great to know more about the % accuracy and how that works rather than doing a bucketload of trial and error.
    Thanks for the video, great as always.

    • @ukgav
      @ukgav 6 лет назад +3

      I found 0.95 works best. The default of blank, which is 0.8 is a bit too fuzzy and you get too many false positives. (The range is 0 - 1 where 1 is the same as a normal join)

  • @RH-nk7eo
    @RH-nk7eo 3 года назад +1

    Can the transformation table contain multiple entries? e.g. Jim -> James, Jim -> Jamie etc.

  • @Fiktage
    @Fiktage 6 лет назад

    Great features! Thanks Patrick for good & simple review as always! i was stuck with the same problem during comparing of two tables (nested join)... i used to make lower case first, than trim, clean... after join.

  • @diegolozano2397
    @diegolozano2397 2 года назад

    Sooo usefull video, thansk a lot . Hugs from colombia 🇨🇴

  • @OzduSoleilDATA
    @OzduSoleilDATA 5 лет назад

    Thanks for helping me understand that transformation table. The Microsoft hit/tip isn't at all helpful, but this video got me through.

  • @selfservicebi1318
    @selfservicebi1318 6 лет назад +1

    Great addition to Power BI Desktop. Looking forward to having it GA together with Composite Models and PDF parsing.

  • @FelixFrost
    @FelixFrost 4 года назад

    nice! loved this guy's energy. Keep it up team!

  • @asjones987
    @asjones987 6 лет назад +4

    So the transformation table has to have headers of to and from? what if there was a join going the other way also? To be safe do we need to have "William" and "Bill" on both sides in the table? Thus "William" & "Bill" AND "Bill" & "William" ?

    • @GuyInACube
      @GuyInACube  6 лет назад +1

      Correct. From and To. If you choose to use it.

  • @livio2963
    @livio2963 6 лет назад +2

    This is a super great addition, especially when working with data coming from those pesky .csv, .txt and excel files filled in by people who got no clue whatsoever :D)
    OUTSTANDING yooooo

    • @GuyInACube
      @GuyInACube  6 лет назад

      YOOOOO! :) Love it! Thanks for watching.

  • @daphnerosario
    @daphnerosario 3 года назад

    Great video! I'm learning so much; love your genuine enthusiasm. Is there a way to reach out to ask a question?

  • @orkhannazarov2781
    @orkhannazarov2781 2 года назад

    This is awesome. Thank you!

  • @dhawalpmehta
    @dhawalpmehta 6 лет назад

    This is super cool, i had to do separate Error reporting for the typo earlier to fix this kind of issues where i was using anti joins to report the errors. But now i don't need to worry. Thanks Pat Fuzzy Matching with Patrick for this demo :)

  • @busello
    @busello 4 года назад

    Great tools! I have trouble with slow refresh and query application. I have about 3000 records to match with other table of 7000 records, and it takes hours! Any suggestions from where to start?

  • @patrickstokes5502
    @patrickstokes5502 Год назад

    Can you merge fields across multiple data sources to create a master Dimension table?

  • @joaquinarroyo315
    @joaquinarroyo315 5 лет назад +1

    This is great. Thanks for another top video! I've been trying this tool quite in detail and still impressed on how the algorithm performs (I think that 0.90/0.95 is the way to go to do not get many false positives) I'm currently testing this tool with a PBI project which is connected to a big data set and looks like is taking ages to run...would love to hear some insights on bigger scenarios. Seems that you know the team behind this product..is there any way to reach/contact them? Thanks!

    • @GuyInACube
      @GuyInACube  5 лет назад

      Great to hear it has been working well for you. I would imagine that with larger datasets it would take longer. Doing lookups and comparison over a large amount of data could be both memory and processor intensive. I don't know of any trick to optimize that really. Maybe post it in community.powerbi.com? The engineering team hangs out there. along with a bunch of other folks that may have suggestions.

    • @joaquinarroyo315
      @joaquinarroyo315 5 лет назад +1

      @@GuyInACube Thanks for your quick response! Yup, I've posted in there early this morning. Let's see if somebody replies. Thanks

    • @GuyInACube
      @GuyInACube  5 лет назад

      @@joaquinarroyo315 Good deal.

  • @sofyan471
    @sofyan471 6 лет назад

    Hi,
    Thanks for great help. You video is actually helping me learn power bi. Have one limitation in power bi i couldn't get workaround for.
    How could i filter data in other sheets of my report based on a scatter plot in my landing page. There is report level slicer i read about but i need scatter plot to filter the data

  • @user-pi2nl6iu2n
    @user-pi2nl6iu2n Год назад

    Respect brother

  • @ridingdatatoGenAI
    @ridingdatatoGenAI 4 года назад

    Hey Patrick, great video, thank you for that, i need to show the result on the dashboard and provide the functionality of search to the user.
    Example : On the dashboard I need to have a search option " Search hear what you need"
    The fuzzy works in the background and find the approx. match and displays the results on the dashboard, like on the websites we do.....
    Please reply "Urgent" support....

  • @Michel-qk8hb
    @Michel-qk8hb 3 года назад

    Great video, thanks

  • @gilmijar
    @gilmijar 5 лет назад

    I wonder what algorithm is used to calculate similarity (or diffeence) between strings in Fuzzy Merge. Is it the good old Levenshtein, Longest Common Substring, or something else?

  • @igershe
    @igershe 5 лет назад

    Can you do fuzzy joins in Excel when joining tables in the data model? I want to join multiple tables for my Powerpivot but I need to use fuzzy matching because I can't match the data exactly. thanks

  • @panayiotisagapiou4095
    @panayiotisagapiou4095 4 года назад

    What if the name is a 3word name (e.g. Chris Down Michael & Michael Chris Down) in the 1st table and in the 2nd table the name i have only 1word names (e.g. Chris). How i am gonna merge this? There is match in both rows of table 1. Thanks!

  • @arjunkoli8457
    @arjunkoli8457 6 лет назад

    Can fuzzy map work with making relationship between data, other than merging

  • @paultoyle6886
    @paultoyle6886 5 лет назад

    can you make a video to compare a string with a numeric value using if statement or switch statement please

  • @yoshihirokawabataify
    @yoshihirokawabataify 6 лет назад

    Nice. 😁😁😁
    I hope the Ignore Wide , and Ignore Kana to Fuzzy Match as Japanese.
    These Ignore feature exist in SQL Server Collation feature.
    and posted as Power BI Ideas. what do you think?

  • @jeffreywong1725
    @jeffreywong1725 5 лет назад

    Hi, I have been looking for the option then preview. i just can't find the fuzzy merge option. i am using the june 2019 version. has that been removed ? Thanks man

  • @MielieBom
    @MielieBom 6 лет назад

    Cool shirt... Where can I get one?

  • @jpcanivel2037
    @jpcanivel2037 6 лет назад +1

    What we do before? Use Fuzzy Look-up in MS Excel. It is good to know that it is available in PBI.

    • @GuyInACube
      @GuyInACube  6 лет назад +1

      Agreed. very happy to see this land inside of Power BI!

  • @VikingGuard
    @VikingGuard 6 лет назад +1

    fuzzy match is just so fun now when @patrick has done his.

    • @GuyInACube
      @GuyInACube  6 лет назад

      haha thanks! And, thanks for watching! Glad you enjoyed it.

  • @dbszepesi
    @dbszepesi 6 лет назад +1

    Where does he get those wonderful toys.....er shirts? -Joker, probably

    • @GuyInACube
      @GuyInACube  6 лет назад

      That shirt was one given away at conferences and there was some variant of the public Microsoft eCommerce store (not the actual Microsoft Store) where you could purchase shirts, and that was one of them. www.microsoftmerchandise.com/Shop/#/ I don't see them listed any longer though :(

    • @dbszepesi
      @dbszepesi 6 лет назад

      boooooo

  • @elrevesyelderecho
    @elrevesyelderecho 6 лет назад +1

    Before: almost checking one by one...now PBI!

  • @eZeeeZe1
    @eZeeeZe1 6 лет назад

    Best resolution for dev in power bi?? 1080p? 2k? 4k? wide?

    • @GuyInACube
      @GuyInACube  6 лет назад

      I would say nothing less than 1080p. I personally have two 34" Ultra-wide monitors. Regardless of what you have, try to target 1080p. I know Power BI Desktop does some scaling, but know that most people probably have some variation of 1080p. So, don't develop with your screen resolution and think it will look great for everyone.

    • @eZeeeZe1
      @eZeeeZe1 6 лет назад

      @@GuyInACube thks!!! I dev in 1080p. But I am thinking to jump to 34'' ultra wide.

  • @aryansena7290
    @aryansena7290 4 года назад

    1.How can I load multiple files with multiple sheet,column order can be different in different pages.2.how to load multiple file type as well as from table data into a single power bi dataset.3.i have 4 power bi desktop file which has same kind of data for 4 different periods (import mode). How to load into single pbix file from all 4 pbix file.4.how to load data from multiple power bi datasets to single power bi desktop .5.i have one complecated question ,which is unanswered in power bi community.can you pls tell me your username in community that I can tag you there.

  • @ZEROONETRAINING
    @ZEROONETRAINING 6 лет назад

    you guys made all kinds of T-shirts with Power BI logo/sign

  • @nicolasdemichieli
    @nicolasdemichieli 6 лет назад

    Can I use the Fuzzy match, to match any work in a sentence, like: match the work “Corolla”, in a sentence , “xei corolla dsl 15”.

    • @GuyInACube
      @GuyInACube  6 лет назад

      You could set the Similarity Threshold lower to see if that works. By default it's set to .8, which requires a pretty high similarity between the two values. Lowering it could possibly increase the possibility of a match.

    • @nicolasdemichieli
      @nicolasdemichieli 6 лет назад +1

      Guy in a Cube BANNGGG!!!! .6 is the number!!! Tks!!

  • @dileepreddi
    @dileepreddi 6 лет назад +1

    Hey from where you are buying those t-shirts ?

    • @GuyInACube
      @GuyInACube  6 лет назад

      The one in the video was from the Microsoft store. But, i don't see it any longer :( other shirts are random purchases.

  • @sofyan471
    @sofyan471 6 лет назад

    Hi ,
    I tried something to get multiple select option in drill through. Although the output wasn't beautiful the underlying objective was achieve. Can you please have a look ? Email address ?

  • @luuminhvuong
    @luuminhvuong 4 года назад

    i gave up after going to the cinema and return....BI still runnning the query

  • @tourianlabs4904
    @tourianlabs4904 5 лет назад

    You are a crack man! Thanks!

  • @jamezday
    @jamezday 4 года назад

    Yoooooo!

  • @trobuk99
    @trobuk99 6 лет назад +1

    DQS in Power BI

    • @GuyInACube
      @GuyInACube  6 лет назад

      Not quite. but getting there.