How can I split a large csv file (7GB) into smaller Csv files using Python| Stack overflow Question

Поделиться
HTML-код
  • Опубликовано: 25 окт 2024

Комментарии • 25

  • @jasonp6597
    @jasonp6597 2 года назад

    You are my hero. Whenever I have a question and need it answered with solid examples: your channel always appears with a video answering my question. A treasure trove for any data practitioner. Thanks.

  • @nearflatline
    @nearflatline Год назад

    Thank you so much this was exactly what I need for slicing up an overly large .csv file. I am glad to have come across your video.

  • @muhannadhammad30
    @muhannadhammad30 9 месяцев назад

    Very helpful. Thank you! 👍

  • @naduarte
    @naduarte 9 месяцев назад

    I tried it for a XLSB file of around ~4GB after converting it with excel to CSV UTF-8, but it doesn't work. Don't know if Excel messed things converting, but seems like it has to much information that Pandas couldn't open, even trying to read it as Bytes or with readlines, or other libraries.
    I tried all the possible pd.read_XXXX methods of Pandas after trying all kinds of formats and its specific engine as pandas argument. Tried XLSB, CSV, ODS, XLS, XLSX, etc and neither of them were able to be read by Pandas. Excel opened the file in around 5-10 minutes, and Pandas couldn't read it, even after 90 minutes (My laptop has i9 12gen + NVME1TB + 32Gb RAM).

  • @adrianxpandelescu
    @adrianxpandelescu 11 месяцев назад

    Excelent !!! Thank you!

  • @mohitbhagat55
    @mohitbhagat55 2 года назад

    Hey, Soumil thanks for this Awesome code.

  • @prateeksarangi9187
    @prateeksarangi9187 2 года назад

    Thanks !! Helps a lot from Zipfile module !!

  • @rishichaturvedi1805
    @rishichaturvedi1805 7 месяцев назад

    Will it work if the chunk remaining at last is less that the chunk size we defined?,And is the procedure same for xlsx file format?

  • @vedhaskharche538
    @vedhaskharche538 2 года назад

    Awesome code, Thanks soumil

  • @p.rakeshkumar6259
    @p.rakeshkumar6259 2 года назад +2

    where should I put my input file?

  • @mrunal2283
    @mrunal2283 Год назад

    my large csv file is in Portuguese language, and i need to get them in English. can you tell me how? i have tried with Translator from googletrans module, that lead to errors in my case that too after a large span of waiting time.

  • @zulfiqurhaider2356
    @zulfiqurhaider2356 2 года назад

    You are just splitting the first 100 data in one CSV. How can you shuffle and store 100 randomly selected data in one CSV?

  • @srikantaghosh2386
    @srikantaghosh2386 Год назад

    Can you create a video on scrapping a web page and store the data into S3 to use it in athena?

  • @neuromyoprosthetics1725
    @neuromyoprosthetics1725 2 года назад

    Where to keep sample file who's is to be splitted into many

  • @ravitalaviya1576
    @ravitalaviya1576 2 года назад

    Hiii sir
    I have 3GB size of json file how to convert in CSV

  • @Ritz9030
    @Ritz9030 Год назад

    what is this platform you are using?

  • @hugocounoy7900
    @hugocounoy7900 Год назад

    Thank you !

  • @georgesmith3022
    @georgesmith3022 2 года назад +1

    Just open the file and use a generator for reading line by line

  • @sonnguyenthanh6811
    @sonnguyenthanh6811 2 года назад

    How could I avoid `the number order` at first column in splitted file?

  • @ohp47
    @ohp47 6 месяцев назад

    Noice … I have a 600 gb file .. I will try this and post the time taken.

  • @Mahmoud-ys1kt
    @Mahmoud-ys1kt Год назад

    Great method

  • @waseemawashra3697
    @waseemawashra3697 2 года назад

    How we can do that on xlsx file !

  • @neuromyoprosthetics1725
    @neuromyoprosthetics1725 2 года назад

    When make videos, always remember that every one not geneious as yourself to understand so fast rapid , go slow dear