Run a word frequency query

 

 

Run a Word Frequency query


In this topic


 


Understand Word Frequency queries

Use Word Frequency queries to list the most frequently occurring words in your sources. You can choose to search only in the textual content of your sources, in the annotations or both.

You can select the source content you want to search, by selecting sources, nodes, sets, folders or search folders.

Before you run a Word Frequency query, make sure the text search language is set to the language of your source materials—refer to Set the text search language and stop words for more information.

You could use a Word Frequency query to

NOTE  You cannot use Word Frequency queries to search for words in framework matrix summaries.

ic_top_of_page.gifTop of Page

Run a Word Frequency query

  1. On the Explore tab, in the Queries group, click New Query, and then click Word Frequency.

rn_explore_queries.gif

The Word Frequency Query dialog box opens.

  1. Adjust the Finding matches slider, if you want to find concepts rather than words. By default, the slider is set to find exact words only—refer to Understanding text match settings for more information.

  2. In the Search in box, select whether you want to search in Text, Annotations or both.

  3. To change the scope of the query:

  1. Under Display Words, you can choose

  1. (Optional) Enter a With minimum length to exclude short  words from the results—for example, enter 7 to display only words with seven or more letters.

  2. Click Run.

NOTE

ic_top_of_page.gifTop of Page

Understand the results

When you run a Word Frequency query the results are displayed in Detail View. There are four tabs on the right—the Summary, Tag Cloud, Tree Map and Cluster Analysis tabs. You can change which tab is displayed by default—refer to the display options in Set application options for more information.

Summary tab

cn_query_wordfrequency_summary.gif

1  The most frequently occurring words excluding any stop words. If you adjusted the Finding matches slider to return similar words, the most frequently occurring word from the group is displayed in this column.

2  Length—the number of letters or characters in the word.

3  Count—the number of times that the word occurs within the project items searched. If you adjusted the Finding matches slider to include similar words, this count is the total for all the similar words.

4  Weighted Percentage—the frequency of the word relative to the total words counted. If you adjusted the Finding matches slider to include similar words, a word may be part of more than one group of similar words. The weighted percentage assigns a portion of the word's frequency to each group so that the overall total does not exceed 100%.

5  Similar Words—other words that have been included as a result of adjusting the Finding matches slider—for example, if you include words with the same stem, then pollutants, pollution, and polluted would be grouped together. This column is not available if you use 'Exact match only'.

Tag Cloud tab

This tab displays up to 100 words alphabetically in varying font sizes, where frequently occurring words are in larger fonts.

cn_query_wordfrequency_tagcloud.gif

Tree Map tab

The Tree Map tab displays up to 100 words as a series of rectangles, where frequently occurring words are in larger rectangles.

cn_query_wordfrequency_treemap.gif

Cluster Analysis tab

The Cluster Analysis tab displays up to 100 words as a horizontal dendrogram, where words that co-occur are clustered together.

cn_query_wordfrequency_similarity.gif

When you click on the cluster analysis diagram, the Cluster Analysis tab (on the ribbon) becomes available, you can use the commands on this ribbon tab to:

For more information, refer to Change the appearance or content of a cluster analysis diagram.

ic_top_of_page.gifTop of Page

See all the references for a selected word

When you run a Word Frequency query, a preview node is created for each word—this lets you see all references to the word. To open a preview node:

In the preview node, you see each occurrence of the selected keyword in context:

cn_query_text_node.gif

The context (the text around the word) is displayed in grey—by default it is a 'narrow' context. To expand the context for a selected reference, on the View tab, in the Detail View group, click Node and choose the coding context.

You can also change the definition of 'narrow', to show more or less words on each side of the selected word—refer to Narrow and broad and custom reach settings for more information.

ic_top_of_page.gifTop of Page

Make nodes from the results of a Word Frequency query

You can create a node that includes all the references to a word you select in the Word Frequency query results.

  1. Run a Word Frequency query.

  2. Select the word you want to use to create a node.

  3. On the Create tab, in the Items group, click Create As and select Create As Node.

rn_create_items.gif

The Select Location dialog box opens.

  1. Select a location and name the node.

  2. Click OK.

NOTE You can also right-click on a word to see the shortcut menu, and click Create As Node.

ic_top_of_page.gifTop of Page

You can run a Text Search query for a selected word in the Word Frequency query results.

  1. On the Explore tab, in the Queries group, click rn_explore_queries_otheractions.gif Other Actions, and then click Run Text Search Query.

The Text Search Query dialog box opens.

  1. (Optional) Change the Text Search Criteria or Query Options.

  2. Click Run.

Refer to Run a Text Search query for more information.

ic_top_of_page.gifTop of Page

When determining the frequency of words, NVivo applies the following rules:

ic_top_of_page.gifTop of Page

Exclude particular words when running Word Frequency queries

Word Frequency queries do not include 'stop words'—by default, these are less significant words like conjunctions or prepositions. You can view and edit the list of stop words, refer to Set the text search language and stop words for more information.

You can add a word displayed in your query results to the stop words list—select the word you want to exclude from the query results, then click Add to Stop Words List, in the Stop Words group on the Analyze tab. The words you add to the stop word list will be excluded the next time you run a Word Frequency or Text Search query.

NOTE  In server projects, only Project Owners can add words to the stop word list—refer to About teamwork in a server project for more information.

ic_top_of_page.gifTop of Page