Can I Make A Search Engine From Scratch?

Поделиться
HTML-код
  • Опубликовано: 27 июн 2020
  • I set out to make my own search engine. Yes there are already options like Google, DuckDuckGo, and Bing. But creating my own helps me better understand how they work, and I can make it function however I would like. I don't know if I will ever host this for the public to use. For now, it's just a project I'm working on. I would love to be able to implement complex queries and word associations, and maybe someday include image search as well.
  • НаукаНаука

Комментарии • 151

  • @seraphimwiththecheese5880
    @seraphimwiththecheese5880 4 года назад +97

    Interesting video! I learned a lot about how search engines work. Keep it up!

    • @_equalo
      @_equalo  4 года назад +5

      Thanks Seraphim!

  • @Foxtrot445
    @Foxtrot445 3 года назад +26

    Him:
    Hey google, how do you sell a child?
    Google:
    We have a Wikpedia article for that!
    4:18

    • @Phiwipuss
      @Phiwipuss 3 года назад +2

      People also ask: How much is a kid worth on the black market?

    • @GamingFemboy
      @GamingFemboy 3 года назад +2

      @@Phiwipuss no just no

    • @Phiwipuss
      @Phiwipuss 3 года назад +1

      ​@@GamingFemboy I got it from the video. -_-

    • @user-oas
      @user-oas 3 года назад

      @@luke-bookbear ?

    • @silverhoney6969
      @silverhoney6969 6 месяцев назад

      Yeah that was a weird example for the creator to use!?

  • @burtmcgurt3584
    @burtmcgurt3584 Год назад +2

    Awesome start! I am excited to see where you go with this!

  • @busterdafydd3096
    @busterdafydd3096 3 года назад +14

    6:05 another fact about your search engine is that it will probably only target english pages and not the whole world's languages

  • @Holleylifestyle
    @Holleylifestyle 2 года назад +1

    This was amazing. Gotta get to work. Thx for the great content.

  • @mrbushy7262
    @mrbushy7262 3 года назад +9

    I watched the video and thought you had 456k subs because it was so good 😅 you earned a sub with notifications.

    • @_equalo
      @_equalo  3 года назад

      Thanks mrbushy!!

  • @SlappyRB
    @SlappyRB 3 года назад +2

    in the first 1 minute, i already am enjoying this video

  • @footiecyclo
    @footiecyclo 3 года назад +77

    Thanks Pewdiepie

    • @LegoCustomsAndMore
      @LegoCustomsAndMore Год назад

      Lol you are so funny i wish i was as funny as you🙄🙄🙄🙄🙄
      Ur like really not funny

  • @moondev369
    @moondev369 2 года назад +2

    I'm working on something similar in machine learning. I'll let you know when i am through! Glad to see you workin do hard on it. You can do it!!!

    • @waynefilkins8394
      @waynefilkins8394 2 года назад

      That's probably the only way to compete these days. It would take so much time and manpower to build something like Google, but if you can incorporate machine learning into it, might bypass a lot of the stuff they had to do the slow way

  • @StrangerHappened
    @StrangerHappened 3 года назад +12

    The lad is pretty adorable, I must say; an interesting content.

  • @aa-qz2ej
    @aa-qz2ej 3 года назад +1

    Just subscribed, great content!

  • @user-gu9hl5uk9f
    @user-gu9hl5uk9f 3 года назад +17

    Try word embedding. It basically changes strings to vectors and index said vectors. When you search you have to vectorize the search term and compare it with saved indexes. It should be faster than using strings to compare

    • @seankim8123
      @seankim8123 3 года назад

      Hi i have some questions

    • @user-xr3rb6pn9m
      @user-xr3rb6pn9m 3 года назад +4

      Better yet one can use proper database (like MySQL or SQL Server) to increase the search speed.
      If there's one table website URL and full text and other 2-3 tables with the most frequent words, bigrams, and trigrams per URL (ordered by frequency), that will be much more efficient than always relying on full-text seach, and it would also improve relevance.

  • @AriJankelowitz
    @AriJankelowitz 3 года назад +4

    Great video with excellent music choice!

  • @user-xr3rb6pn9m
    @user-xr3rb6pn9m 3 года назад +20

    Great project!
    Are you aware of nltk package? It is capable of removing stop-words, stemming, word and word collocation frequencies, and so on. I believe that could help greatly with indexing.

    • @_equalo
      @_equalo  3 года назад +3

      Good point! I’ve used NLTK a little, but wasn’t confident enough with it to apply it to this project. Maybe it’s time for me to dig into it

    • @Snakeythepuppet
      @Snakeythepuppet 9 месяцев назад

      @@_equalowhat happened to your channel?

  • @builder481
    @builder481 2 года назад +1

    Amazing video and also very funny lmao

  • @manishbhati2722
    @manishbhati2722 2 года назад +2

    I want to know further about indexing. If possible, make next part of this search engine video.

  • @alisiddiquii
    @alisiddiquii Год назад +1

    Thanks for this interesting video, I've learnt about how search engine works after passing my exam

  • @snehashisbera8316
    @snehashisbera8316 2 года назад

    thanks bro, I Learned a lot today by practical way.

  • @sirrealsam
    @sirrealsam 4 года назад +7

    00:29 had me crack up, haha :-D

  • @vikinggeorge7007
    @vikinggeorge7007 Год назад +1

    The second I saw python I stopped the video to rethink my life

  • @marvelousmarvelxx3889
    @marvelousmarvelxx3889 3 года назад +1

    Subscribed!

  • @somewherenear3003
    @somewherenear3003 3 года назад +1

    Oh hey! I decided to make my own search engine this year back in 2020 too. Now is the time I'll be doing this project. I'll be sharing my progress on my channel.

  • @TiagoTiagoT
    @TiagoTiagoT 3 года назад +2

    How about using GTP-style "tokens" to encode both page contents and search keywords?

  • @PaAGadirajuSanjayVarma
    @PaAGadirajuSanjayVarma 4 года назад +5

    Good work bro.try to use hashing of words in a web page and store them in a hash table.I think it might increase it.search hashes instead of word to word

    • @_equalo
      @_equalo  4 года назад +1

      That’s a smart option! I’ll try to do a follow up video trying that and a couple other methods

  • @bruinebeerinhetblauwehuis
    @bruinebeerinhetblauwehuis Год назад

    Very interesting project from Michael Falk!

  • @phillipspodcast
    @phillipspodcast 3 года назад

    Good video mate

  • @myztartupjourney6772
    @myztartupjourney6772 2 года назад +2

    Equalo you should make a part 2 to this video!!

  • @wickederebus
    @wickederebus 3 года назад +4

    so, im guessing this project did not get a follow up video?

  • @krissna9697
    @krissna9697 Год назад +1

    To create a search I'm a coumputer science student beginner .there are lot of fields in computer science major, in all of these which field do I specifically need to take to develope a search engine? please

  • @ryanmacalandag5279
    @ryanmacalandag5279 2 года назад +3

    I'm also interested. I'm looking into building a search engine for a group of less than 200 related websites only. This is an insightful video. Hopefully you you post the code. Thanks

    • @zanjeev8654
      @zanjeev8654 Год назад

      Did you create your own search engine?

  • @av3stube480
    @av3stube480 Год назад +2

    Was the number of links you parsed from the second batch of pages referring to the total links or the amount of unique links? It's pretty easy to think of an example of a set of Wikipedia pages that all link back to one article, or even two or more pages that all connect to each other and create endless loops.
    Of course, looking for changes in website contents is necessary, but avoiding crawling over the same pages too often should speed up the rate of expanding the database and reduce the strain on the hardware in the long term.

    • @dh2032
      @dh2032 Год назад

      you would have a process. search, in document, duplications, but you would still Identify if the are in deed duplications, and not just similar, and even if it was real duplications, it was still linking for a reason, unless it just the home button or something like that?

  • @annuritv4617
    @annuritv4617 3 года назад +6

    I'm ready to help.

  • @ToniMartiAlbons
    @ToniMartiAlbons 3 года назад

    Nice video 👌

  • @prithivirajr7918
    @prithivirajr7918 3 года назад +2

    WHATS THE LEVEL OF YOUR SERCHENGINE NOW

  • @sebastianjohannes2316
    @sebastianjohannes2316 2 года назад +1

    What coding tool do you use to create the search engine?

  • @FilippoBerardo
    @FilippoBerardo 7 месяцев назад

    question: The search of pages can be done even with ip addresses, cycling every possibile number? Or only with links? Using links means you must have a list.

  • @jasonfanclub4267
    @jasonfanclub4267 2 года назад

    Good content

  • @vihaankedia8134
    @vihaankedia8134 3 года назад +12

    could you send your source code for the search engine

    • @nyancat5140
      @nyancat5140 3 года назад +2

      Yeah, I was trying to do this by myself. A link to the code would be great!

  • @adamayala3906
    @adamayala3906 4 года назад +4

    Nice

    • @_equalo
      @_equalo  4 года назад

      Thanks Adam!

  • @Carambal81
    @Carambal81 4 года назад +12

    Your videos are very informative, I just learned how to sell my child! :P (@4:18)

    • @_equalo
      @_equalo  4 года назад +7

      Haha I’m always happy to help. Glad you found that part useful

    • @divyanshusah2809
      @divyanshusah2809 3 года назад +4

      @@_equalo 🤣🤣🤣

  • @JamesScottGuitar
    @JamesScottGuitar 3 года назад +3

    How’s the project going now?

  • @Zz-ol5bx
    @Zz-ol5bx 3 года назад

    But won't this method be too much time consuming?is there any other way to make it faster or automate it bro

  • @Plexversal
    @Plexversal 3 года назад +1

    9:10 lmfao that gif

  • @callofdutymobile1074
    @callofdutymobile1074 2 года назад

    More interested in the coding aspect you share source code or how to get started

  • @sparrowEP
    @sparrowEP Год назад

    2:57
    when u open OBS for the first time:

  • @ryxn3x
    @ryxn3x 4 года назад +5

    👍

  • @busterdafydd3096
    @busterdafydd3096 3 года назад

    5:10 it wouldn't be a bad idea to chuck the word recursion in there and state that that's what your crawling and parsing is doing

  • @AndrewsLorenzana
    @AndrewsLorenzana 3 года назад +1

    Hey, how is your search engine going?

  • @mathcloud
    @mathcloud 3 года назад +1

    So where is your search engine? No link?

  • @movocode
    @movocode 3 года назад +3

    4:19 Did anyone see what he typed in the Search !! 🤣🤣🤣

  • @sprinteroz2239
    @sprinteroz2239 2 года назад

    Is this on github or you keeping the code private?

  • @merrytantrimilleniatobing3740
    @merrytantrimilleniatobing3740 2 года назад

    Can you give us the tutorial to make that search engine, sir?

  • @allanbenedict4558
    @allanbenedict4558 3 года назад

    Like it

  • @NickKartha
    @NickKartha 2 года назад

    Hope the project progressed beyond python scripts.

  • @melindamassey14
    @melindamassey14 3 месяца назад

    Your choice of wiki topic to search????

  • @coyzee1
    @coyzee1 3 года назад

    LOL, 7.35 She gushed over him. Sorry about that. Very interesting, thanks for the vid.

  • @MoyaNandaoOfficial
    @MoyaNandaoOfficial 3 года назад

    Please upload for me the download link of your search engine. Thanks

  • @gspapp
    @gspapp 3 года назад +1

    link?

  • @subhakantasahoo9760
    @subhakantasahoo9760 3 года назад +2

    I searched for it after whatspp's new january privacy up-to-date 😃😃

  • @iamanishkumar
    @iamanishkumar 2 года назад

    Why aren't you making another video?

  • @mawrahassan1973
    @mawrahassan1973 2 года назад

    I think this is the guy who has a channel named "what I've learned"

  • @namithshetty
    @namithshetty 3 года назад

    Bro go on title and headline base because then only you can get the information

  • @busterdafydd3096
    @busterdafydd3096 3 года назад

    5:27 so now you start indexing. keywords in the websites links to websites

  • @RageBird7200
    @RageBird7200 11 месяцев назад

    What about Bing Users?

  • @LittleNinjaXYBA
    @LittleNinjaXYBA 11 месяцев назад

    I have duck duck go! 🥳

  • @universenerdd
    @universenerdd 4 года назад +2

    From scratch just bothers me for some reason

  • @dr.official9852
    @dr.official9852 3 года назад

    Wait a minute

  • @twansmith2622
    @twansmith2622 2 года назад

    Hello r u still working on this

  • @vhwjpzf1z0fi73a
    @vhwjpzf1z0fi73a 2 года назад

    Any news on this?

  • @menopriezvisko2232
    @menopriezvisko2232 Год назад

    try arango database for inverted index searching

  • @OfficalOxy
    @OfficalOxy Год назад

    duckduckgo is getting popular

  • @pavannaidu759
    @pavannaidu759 2 года назад

    can you share the code

  • @legixstudio6713
    @legixstudio6713 3 года назад

    7:45
    is'nt scott morison the priminister

  • @madabtMSK
    @madabtMSK 3 года назад +1

    good try!

  • @raymondchiwade
    @raymondchiwade 3 года назад

    How can I contact you

  • @RahulSharma-oj4ik
    @RahulSharma-oj4ik 3 года назад

    PewDiePie's Got A Search Engine😀

  • @RmbTfitness
    @RmbTfitness 3 года назад

    I laughed a lot... watching your video

  • @prudvi01
    @prudvi01 3 года назад

    duck gang

  • @Ab-cj6gl
    @Ab-cj6gl 2 года назад

    i was planning to build something similar but ain't gonna happen 😂

  • @apang1831
    @apang1831 Год назад

    "how to sell your child"

  • @saidkauzu7831
    @saidkauzu7831 3 года назад

    I have an idea do you want listen?

  • @Naderium
    @Naderium 3 года назад +2

    420th like

  • @ahsanabrar880
    @ahsanabrar880 2 года назад

    please share update.

  • @makersspace565
    @makersspace565 3 года назад +1

    petabytes=too expensive

  • @jessetate7601
    @jessetate7601 3 года назад

    Use folder and use letter to me the floder

  • @Captainsunnyxxx
    @Captainsunnyxxx 2 года назад

    😎My friends says -
    You can't compete...
    👩My mother says -
    Never give up...
    😇"I follow what my mother says"

  • @TF-cn6oj
    @TF-cn6oj Год назад

    Update please

  • @rylandsquires4886
    @rylandsquires4886 3 года назад +1

    Watch at 1.5x speed

  • @sparrowEP
    @sparrowEP Год назад

    "mom can we have google"
    "we have google at home"
    google at home

  • @SlumDawgSaint
    @SlumDawgSaint 2 года назад

    Did you make it, i want to make it and make it public and add free and zero tracking of people good gearh engine add free bias free...;)

  • @joshclaassen616
    @joshclaassen616 3 года назад

    Check out safenetwork.tech there are a few people pondering how to implement a search engine and you may enjoy the challenge.

  • @Randomynous01
    @Randomynous01 2 года назад

    Google no longer operates on the number of links that link back to it, but rather PRIORITIZATION

  • @marlonlopez4154
    @marlonlopez4154 2 года назад

    Can you email me really interested.

  • @ahrorbekabdullayev2193
    @ahrorbekabdullayev2193 3 года назад

    I'd prefer wikipedia api

  • @busterdafydd3096
    @busterdafydd3096 3 года назад

    You didn't even paritially cover indexing in this video

  • @elliotsearchengine8626
    @elliotsearchengine8626 3 года назад

    We did something different
    Check ✔️ Elliot Search

  • @twistah
    @twistah 3 года назад

    Am I the only one who uses duckduckgo

    • @techample
      @techample 2 года назад

      Yes.
      Can't imagine someone not using Google😃

  • @rxkshan
    @rxkshan 3 года назад

    chuper

  • @mathquik1872
    @mathquik1872 3 года назад +7

    Are you on onlyfans?

  • @chris_the_nerd09
    @chris_the_nerd09 6 месяцев назад

    Opera is better than Google