IndexMatic is very good at finding recurring text patterns in a document (brands, products, proper names, URLs…) and Wordalizer is very good at making customizable word clouds based on item frequencies. Here is how you can branch the former to the latter…
Let's first open our InDesign document. In my example I plan to extract all the proper names that specifically occur in the footnotes.
Running IndexMatic, I select the footnote paragraph style and go into the Single query mode. Here I will use the usual query
/\m\w+/ in order to grab all words starting with a capital letter. Then, instead of building an index, I trigger a hit report:
In the Hit Report dialog, let's make sure the options Display stats and Sort by frequency are checked:
IndexMatic might generate a large number of results, including unwanted entries (this depend on your document, your query, and your goal.) Edit the data in your text editor so that only the words and frequencies you want to visualize remain in the set:
Going back to InDesign, create a temporary document and fill a text frame with your word list. Note that words and frequencies are still formatted in a way that Wordalizer cannot properly parse:
so we run a quick GREP command to replace multiple spaces by a colon punctuation mark:
Before you go further, make sure that the option Detect word list is turned on in Wordalizer's preference panel. (To check and/or change your preferences, run the script and click the gear icon.)
Then, having your word list in the active document, run Wordalizer. In the Source panel, select the document icon and disable the language flag:
When the data are loaded in Wordalizer, adjust your settings as needed. If necessary, click the Boost low entry weights button to make low frequencies more perceptible:
Finally, create the word cloud, and enjoy the result!