Hi Dr. Laurence... How may i remove a word from the word list? Can I do that? I mean, I want to generate a list removing some words. I believe that was posible in previous versions.
Hello Dr. Anthony, I've been trying to follow an older video of yours on how to work with lemmata, but the user interface changed since then, and I can no longer upload a lemma list as shown in the video. I can see in this video that there is still a greyed out "lemma list" functionality (even though i cant see it on my device). Where can i find the option to upload lemma lists and build word lists based on them on this Version on AntConc? Thanks a lot!
Hello. May I ask, what is the thing you type to search for the wildcard? Is it w^ or w' or w"? I tried several times with my Antcon but I didn't get any results. Looking forward to your answers. Thank you.
Dear Professor Anthony, I would like to compare a corpus to another corpus with the help of your software. I was wondering what exactly the numbers in Normfrequency relate to and how I could use this section to compare corpora, since you mentioned that Normfrequency would be practical to compare corpora of various sizes. Thank you so much in advance. Best Regards
Hi! All the normed frequencies are comparable across corpora. A normed frequency is the frequency of the target item divided by the total number of tokens in the corpus as a whole, multiplied by a scaling factor (e.g. 1000) so that that we get a "freq. per 1000 words" or "freq. per million words" value which is comparable across corpora. I hope that helps.
Hello, your tutorial is very helpful! I was wondering if there is a possibility to exclude some words, e.g. articles, filler words and other grammatical words that by necessity will appear many times. Thank you very much if you answer me!
Hi Anthony. I'm trying to hide stop words in my corpus in the wordlist tool but I'm having a problem with it. These are the steps that I did (perhaps you can let me know what I did wrong): Upload corpus > Global settings > Tool filters > Add stop words file (in txt format) > tick the cluster/n-gram etc checkbox > tick the hide words in file checkbox > click apply > run my wordlist (stop words are still there) Thank you in advance for your help! :)
Hi, I just tested the latest version of AntConc here and it works fine. I suggest you start by just using one of the in-built corpora (e.g. The AmE06 learned corpus) and then load in a stop list with a single word inside (e.g. "the"). Make sure you can get this working and then try the stop list on your own files. Let me know what happens.
Professor Anthony, thanks for sharing such a useful video! Can you please show how lemma file is applied? I read the tutorial and watched video, but still find it hard to use it. What I expect to see is, for example, 'look (30) - looks (2) - looked (3) - looking (10)', like this. But What i can see now is totally different from that.
Hi. Rather than using a lemma file, I would recommend that you simply POS tag your raw files with a POS tagger like TagAnt. If you really want to use a lemma file, you need to apply it when you build your corpus, via the Corpus Manager. See the help guide for instructions.
Yes. You can apply a lemma list when you build your corpus in the corpus manager. The stop list is in the global settings (under filter) and applies across multiple tools.
Hello, I want to ask a question- in this version there is a possibility of "stoplist word" like in previous version (e.g. 3.5.8)? In the section "advances search" I can found the option "Search Query List", but I'm looking for the opposite function. Thank for your job!
Hi Monika. I think what you are looking for is the filter option that you can find in the global settings. You can now apply a stop list across multiple tools, which is why I moved the setting to the global settings.
Hi Bruna. Yes, you can do this. I've moved the function to the global settings as you can now apply a stop list in multiple tools. Check the "Tool Filters" option, where you can set a list of words to 'see' or 'hide' in the results.
Hello Dr. Anthony! I am currently trying to create a corpus about the language used in marine biology and I try to find the most frequent discipline-specific words using the Word list tool. Is there a way to eliminate function words from my search? Thank you very much in advance!
Hi! The easiest way is to load a list of function words in the global settings under tool filters and then choose to hide them in the results. A more advanced way is to compare your corpus against a general corpus and generate keywords. You could try using the 1 million word AmE06 or BE06 from the corpus manager as the reference. Watch the keyword tool video, which explains this.
Hi Laurence. I wanted to ask you something about the "Headword/Grouping list" option in the corpus manager. How is the format for the list supposed to look like? Also, how do I form queries using the headword/grouping terms (I know it's possible, but I don't know how to do it)? Sorry if this information is already in the manual, but I haven't been able to find it. Thank you.
Hi Mark, These are great questions. I checked the help page and it doesn't explain what the format should be! How silly! The format is a simple TSV file of the headword and the family members, all separated by tab spaces. cat cats cat dog dogs dog To search for headwords, the simplest way is with as follows: *_*_cat (to search for all words with "cat" as the headword. The whole topic of POS tagging and LEMMA searching in AntConc 4 is currently a little under documented. I'll try to address this as soon as possible. By the way, I recommend you join the AntConc discussion group, where discussion can take place a little easier.
Hi. Sorry for the slow response. A few others have reported on this, too. If you use the Windows version in Wine, you should find everything works fine. I'm now looking to create a Flatpak version, which should not have the same issues across different Linux distros.
Dear Professor Anthony, Thank you so much for all your programs and your tutorials. I am an International law Phd candidate and I have been using Antcon for a few weeks. Lingustic not being my specialty and not being very confortable with programming I have some difficulties but I generally always find an answer in your videos or website. I have seen in another video that you can research words from a word list. I made one with a total of 282 verbs (docx and rtf format) and I would like to upload it to confront it with my corpus (tagged and untagged). I cannot seem to find the way to do it in Antcon 4.1.4, could you please help me ? Thank you so much in advance, Respectfully
Hi Laurence, I am Englist philology student. I need your assistance! Please explain how to get top 10 lists of nouns, adjectives, verbs. Thank you in advance.
Hi, This sounds a bit like a homework project. But, anyway, here are some hints. 1. POS tag the text data (e.g. using my TagAnt tool) 2. Load the POS tagged data in AntConc using the "simple_word_tag_headword" indexer. 3. Search for the word types preceded with a wildcard (e.g. *_NOUNTAG) I hope that helps! Laurence.
@@AntLabJPN Dear Laurence, yes, I have research paper. Thank you for advise. Could you please explain what is POS? If I correctly understood I should upload TagAnt for part of speech importing. Do you have youtube link with TagAnt? Thank you for your assistance, because you are my last chance.
@@AntLabJPN Dear Laurence, I uploaded plain text with POS into AntConc4.2.0, however AntConc counts a noun as a word, and it has a high frequency. Inserted query *_NOUN doesn't work. I used word+pos, because TagAnt 2.0.5 doesn't have "simple_word_tag_headword". Could you advise where is my mistake?
Hi Dr. Laurence... How may i remove a word from the word list? Can I do that? I mean, I want to generate a list removing some words. I believe that was posible in previous versions.
Hi Luiz, do you happen to know how to do it already? Because I'm having the same problem
Hi. You can use the "Filter" tool in the global settings.
Hello Dr. Anthony, I've been trying to follow an older video of yours on how to work with lemmata, but the user interface changed since then, and I can no longer upload a lemma list as shown in the video. I can see in this video that there is still a greyed out "lemma list" functionality (even though i cant see it on my device). Where can i find the option to upload lemma lists and build word lists based on them on this Version on AntConc? Thanks a lot!
Hi. You need to load the lemma list when you build your corpus using the Raw files option in the Corpus Manager. See the help guide for instructions.
Is there a way to remove stop words from the results (a, of, the, etc.)?
Hello.
May I ask, what is the thing you type to search for the wildcard? Is it w^ or w' or w"?
I tried several times with my Antcon but I didn't get any results.
Looking forward to your answers.
Thank you.
Hi. It's w+
The wildcards are listed in the global settings.
Thank you very much for everything. How can i stop word repeating like the reference list?
I'm not sure what you mean. Can you rephrase your question?
Dear Professor Anthony,
I would like to compare a corpus to another corpus with the help of your software. I was wondering what exactly the numbers in Normfrequency relate to and how I could use this section to compare corpora, since you mentioned that Normfrequency would be practical to compare corpora of various sizes. Thank you so much in advance.
Best Regards
Hi! All the normed frequencies are comparable across corpora. A normed frequency is the frequency of the target item divided by the total number of tokens in the corpus as a whole, multiplied by a scaling factor (e.g. 1000) so that that we get a "freq. per 1000 words" or "freq. per million words" value which is comparable across corpora. I hope that helps.
is there a way to see what type of the word it is e.g noun verb etc
Yes! If you load in a part-of-speech (POS) tagged corpus, you can then view the POS in the results table.
Hello, your tutorial is very helpful! I was wondering if there is a possibility to exclude some words, e.g. articles, filler words and other grammatical words that by necessity will appear many times. Thank you very much if you answer me!
Yes. You can do this via the "Tool Filters" in the Global Settings. It can be applied to multiple tools (not just the Word list tool).
Hi Anthony. I'm trying to hide stop words in my corpus in the wordlist tool but I'm having a problem with it. These are the steps that I did (perhaps you can let me know what I did wrong):
Upload corpus > Global settings > Tool filters > Add stop words file (in txt format) > tick the cluster/n-gram etc checkbox > tick the hide words in file checkbox > click apply > run my wordlist (stop words are still there)
Thank you in advance for your help! :)
Hi, I just tested the latest version of AntConc here and it works fine. I suggest you start by just using one of the in-built corpora (e.g. The AmE06 learned corpus) and then load in a stop list with a single word inside (e.g. "the"). Make sure you can get this working and then try the stop list on your own files. Let me know what happens.
@@AntLabJPN Thank you for replying, Anthony. All is good now. Thanks again :)
@@tashataufek6748 Great!
Professor Anthony,
thanks for sharing such a useful video!
Can you please show how lemma file is applied? I read the tutorial and watched video, but still find it hard to use it. What I expect to see is, for example, 'look (30) - looks (2) - looked (3) - looking (10)', like this. But What i can see now is totally different from that.
Hi. Rather than using a lemma file, I would recommend that you simply POS tag your raw files with a POS tagger like TagAnt. If you really want to use a lemma file, you need to apply it when you build your corpus, via the Corpus Manager. See the help guide for instructions.
Hi Dr Anthony. Does the antconc have features lemmalist and stoplist? and how to use these features
Yes. You can apply a lemma list when you build your corpus in the corpus manager. The stop list is in the global settings (under filter) and applies across multiple tools.
@@AntLabJPN Thank you
Hello, I want to ask a question- in this version there is a possibility of "stoplist word" like in previous version (e.g. 3.5.8)? In the section "advances search" I can found the option "Search Query List", but I'm looking for the opposite function. Thank for your job!
Hi Monika. I think what you are looking for is the filter option that you can find in the global settings. You can now apply a stop list across multiple tools, which is why I moved the setting to the global settings.
Hi Laurence, is there a way to add a stop list? Is it similar to the previous AntConc version? I didn’t find the same options. Thanks!
Hi Bruna. Yes, you can do this. I've moved the function to the global settings as you can now apply a stop list in multiple tools. Check the "Tool Filters" option, where you can set a list of words to 'see' or 'hide' in the results.
@@AntLabJPN that’s great, thank you for your prompt reply!
Hello Dr. Anthony! I am currently trying to create a corpus about the language used in marine biology and I try to find the most frequent discipline-specific words using the Word list tool. Is there a way to eliminate function words from my search? Thank you very much in advance!
Hi! The easiest way is to load a list of function words in the global settings under tool filters and then choose to hide them in the results. A more advanced way is to compare your corpus against a general corpus and generate keywords. You could try using the 1 million word AmE06 or BE06 from the corpus manager as the reference. Watch the keyword tool video, which explains this.
Hi Laurence. I wanted to ask you something about the "Headword/Grouping list" option in the corpus manager. How is the format for the list supposed to look like? Also, how do I form queries using the headword/grouping terms (I know it's possible, but I don't know how to do it)? Sorry if this information is already in the manual, but I haven't been able to find it. Thank you.
Hi Mark,
These are great questions. I checked the help page and it doesn't explain what the format should be! How silly! The format is a simple TSV file of the headword and the family members, all separated by tab spaces.
cat cats cat
dog dogs dog
To search for headwords, the simplest way is with as follows:
*_*_cat (to search for all words with "cat" as the headword.
The whole topic of POS tagging and LEMMA searching in AntConc 4 is currently a little under documented. I'll try to address this as soon as possible.
By the way, I recommend you join the AntConc discussion group, where discussion can take place a little easier.
Hi thank you for the great app. I am struggling to search for a list of words ( very long list of words)
Hi. Try using the advanced search function, or you can use the filter function in the global settings.
Great tool, thank you! But "Save the current result" crashes on ubuntu 22.04. Copy works though.
Hi. Sorry for the slow response. A few others have reported on this, too. If you use the Windows version in Wine, you should find everything works fine. I'm now looking to create a Flatpak version, which should not have the same issues across different Linux distros.
Dear Professor Anthony,
Thank you so much for all your programs and your tutorials. I am an International law Phd candidate and I have been using Antcon for a few weeks. Lingustic not being my specialty and not being very confortable with programming I have some difficulties but I generally always find an answer in your videos or website. I have seen in another video that you can research words from a word list. I made one with a total of 282 verbs (docx and rtf format) and I would like to upload it to confront it with my corpus (tagged and untagged). I cannot seem to find the way to do it in Antcon 4.1.4, could you please help me ?
Thank you so much in advance,
Respectfully
Are you wanting to search for words in your list or just filter the corpus wordlist results based on your list?
@@AntLabJPN filter the corpus results through my list, please.
@@lisaaerts7464 For this, you can load your list in the Global Settngs-> Tool Filters option. I hope that helps!
Hi Laurence, I am Englist philology student. I need your assistance! Please explain how to get top 10 lists of nouns, adjectives, verbs. Thank you in advance.
Hi,
This sounds a bit like a homework project. But, anyway, here are some hints.
1. POS tag the text data (e.g. using my TagAnt tool)
2. Load the POS tagged data in AntConc using the "simple_word_tag_headword" indexer.
3. Search for the word types preceded with a wildcard (e.g. *_NOUNTAG)
I hope that helps!
Laurence.
@@AntLabJPN Dear Laurence, yes, I have research paper. Thank you for advise. Could you please explain what is POS?
If I correctly understood I should upload TagAnt for part of speech importing. Do you have youtube link with TagAnt? Thank you for your assistance, because you are my last chance.
@@play-mq7oq POS means "part of speech". If you search for TagAnt, the first hit will be my tool.
www.laurenceanthony.net/software/tagant/
@@AntLabJPN Dear Laurence, I uploaded plain text with POS into AntConc4.2.0, however AntConc counts a noun as a word, and it has a high frequency. Inserted query *_NOUN doesn't work. I used word+pos, because TagAnt 2.0.5 doesn't have "simple_word_tag_headword". Could you advise where is my mistake?
@@play-mq7oq You need to load "word+postag" data into AntConc with the AntConc "simple_word_tag_headword" indexer.
thank you Dr. Anthony.
You're very welcome! I'm hope my tools are serving you well.
thanks a lot for the effictive Video
Thanks!