Text Preprocessing

Поделиться
HTML-код
  • Опубликовано: 31 дек 2024

Комментарии • 12

  • @khairulikhwanazman6094
    @khairulikhwanazman6094 2 месяца назад +1

    I need to save the clean and preprocessed word from orange in excel but whenever I save data it just revert back to the original data

  • @eylmaz6696
    @eylmaz6696 8 месяцев назад

    does orange have Cumulatif distribuation function and probability distribution function to get out the results ?

    • @OrangeDataMining
      @OrangeDataMining  8 месяцев назад

      No sure what you wish to achieve, but these options are available in the Distributions widget.

    • @eylmaz6696
      @eylmaz6696 8 месяцев назад

      @@OrangeDataMining for clustering on kmeans algorithm ? which one is important silhouıette skor or centering on the intersection is important by checkng scatter polot?

    • @eylmaz6696
      @eylmaz6696 8 месяцев назад

      @@OrangeDataMining for kmeans clustering ; how can i make a comment for result ? for instance, relation between ; I have anxeity, I dont have anxiety ; I sleep much ; I dont sleep much. When I cluster them, will i comment it by using the silhouette score maximum ?

    • @OrangeDataMining
      @OrangeDataMining  8 месяцев назад

      @@eylmaz6696 Apologies, I don't quite understand the question.

    • @eylmaz6696
      @eylmaz6696 8 месяцев назад

      @@OrangeDataMining do you have support mail or phone ? can i get one question

  • @neilirvine7129
    @neilirvine7129 8 месяцев назад

    Love it!

  • @gabrielapinto5306
    @gabrielapinto5306 7 месяцев назад

    I am finding it difficult to adapt all that to tweets written in Portuguese. Does orange have a solution?

    • @OrangeDataMining
      @OrangeDataMining  7 месяцев назад

      Yes. The tokenizer remains the same. Stopwords are available for Portuguese, too. Same for lemmatization (only UDPipe). SBERT and FastText also support pt. In summary, most language specific methods support Portuguese (some also pt-br), others are language independent.

  • @nadiamaelaniulfah1100
    @nadiamaelaniulfah1100 4 месяца назад

    does orange not support arabic? orange told "no text found" when i'm uploading my arabic corpus. any solution for this?🥲

    • @OrangeDataMining
      @OrangeDataMining  4 месяца назад

      Orange supports Arabic to some extent. There is Arabic lemmatizer with UDPipe, stopwords from NLTK, and embedders in Document Embedding. Your error tells you you are likely missing a text variable in your data. Please head to our discussions board (github.com/biolab/orange3-text/discussions) where we pinpoint your problem.