This FAQ is now obsolete. Please check out the new version of Wordalizer.
Want to improve your mastery of Wordalizer and become the King of InDesign Word Clouds? Well! This FAQ reveals the obscure features of the script, provides advanced tips and tricks and helps you to understand limitations and specific issues.
• I currently use InDesign CS4 (Mac OS X) but I plan to upgrade to CS5 soon. If I purchase Wordalizer PRO today, will my license work on ID CS5?
[Wordalizer Pro tested in InDesign CS5, French UI. Thanks to Stéphane Baril from Adobe France.]
• What means the option labeled “Parse Case” in the Parser panel?
[BOTH VERSIONS] The “Parse Case” box is checked by default. It tells Wordalizer to detect and gather the case variants of a same word during the parsing process. For example, if the text contains several occurrences of
INSTALLATION, the parser records
installation as the prototypal form and counts all occurrences under this entry. Occasionally you may need to distinguish case variants (e.g.
bill). Then uncheck “Parse Case”.
Note that the “Parse Case” option is disabled when you check “Uppercase” in the Characters panel because all words are then converted into their uppercase form:
• What is the difference between “Previous word list” and “Active word cloud” in the Source panel?
[PRO VERSION] When you run Wordalizer PRO repeatedly during the same InDesign session, the script keeps in memory your current preferences, including the word list it has previously built. This allows you to skip the parsing process if you work with the same text source, and to fine tune the cloud design from the latest settings. In addition, each time a word cloud is generated, Wordalizer permanently saves its own settings in the resulting InDesign document. So for example you can backup a cloud on your hard disk, quit InDesign, then work later from that document settings by reopening the file —which makes it the “Active word cloud.” Therefore, there are circumstances in which Wordalizer can apply either the preferences of the current session, or those of the active word cloud. In this case, you can choose the desired word list in the Source panel by checking either “Previous word list”, or “Active word cloud”:
When appropriate, Wordalizer also gives you the option to apply the active word cloud settings rather than the session preferences:
• What means the option labeled “Allow Embedded Words” in the Cloud Design panel?
[BOTH VERSIONS] When “Allow Embedded Words” is checked, Wordalizer is allowed to incorporate words within other words, provided their bounds do not intersect. To achieve this effect, use a heavy solid font and make sure that your lexicon contains low weighted words:
• Why is the Parsel panel disabled when I select “Active word cloud” ?
[PRO VERSION] Since the parser's role is to extract a list of lexical weighted words from a source text, it has nothing to do if the list is already provided. The option labeled “Active word cloud” causes the script to reuse the word list —the terms and their respective weights— associated with the existing word cloud.
Advanced Tips and Tricks
• How to create a word cloud from a web page?
[BOTH VERSIONS] First, open the page in your browser. Select and copy (
Cmd C) the text to parse. Open InDesign and run Wordalizer from the Scripts panel. In the Source panel, check the Clipboard radio button. If you are using the PRO version, you can also check the “Edit Weighted List” box in order to manage the words before generating the cloud. Tune the other settings for your purpose and press OK.
• Can I directly provide a “weighted word list” to Wordalizer?
[BOTH VERSIONS] Yes, both versions offer the ability to supply a short weighted list rather than a text. Note that this hidden feature will not work if the entire list contains more than 1024 characters. The easiest way to provide your data is to create a text frame in InDesign, then to enter the weighted words in the following format:
word1 : weight1 word2 : weight2 word3 : weight3 etc.
weight[i] is a positive number.
Here is a simple example:
When Wordalizer detects that the source contains a weighted list, it automatically affects the weights to the corresponding terms and skips the parsing process. Note that this method allows you to inject spaces or exotic characters in your entries. Don't forget to select the appropriate source in the Source panel. Weighted word list can also be provided from the clipboard or plain text file.
• I appreciate the “Rarest Words” feature, but it consumes a longer processing time without providing a sufficient variety of word weights. How to fix this?
[PRO VERSION] The rarest words tend to share the same weight because they appear once or twice in the text. Consequently, the resulting list does not contain a significant gap between the word weights, which on the one hand bogs down the placement routine, on the other hand reduces the ‘contrast’ of the final word cloud. As a workaround, check the “Edit Weighted List” box in the Source panel and consider to manually change a few weights in the list. For example, if all entries are rated 100, apply randomly some 60, 70, 80, and 90 values. The resulting word cloud will surely look better, although it becomes statistically wrong!
• How to recover the word list of an existing word cloud?
[PRO VERSION] Open the word cloud document (or activate it), and run Wordalizer. Make sure that “Active Word Cloud” is selected in the Source panel and check the “Edit Weighted List” box. Press OK. You can then select and copy the entire list from the list editor. (Press Cancel to avoid building a new word cloud.)
• Will the word cloud be available as editable text (and not vectors)?
[BOTH VERSIONS] No, it won't. The current version of Wordalizer does not produce editable text frames. We might consider to add a vector-to-text feature in a future release, but it is really not easy to perform since the main algorithm is purely based on vectors!
• Is there a word length limit?
[BOTH VERSIONS] See below: “I occasionally get an error #55...” in the KNOWN ISSUES section.
• Since Wordalizer TRY doesn't allow to change the source language, what is the default available language?
[TRY VERSION] It is important to distinguish between the user interface language and the parser language(s) available in the Parser panel. For the time being Wordalizer provides two UI languages (English, French) and supports six Parser languages (English, French, German, Spanish, Portuguese, and Russian). The script UI dialog is displayed in English unless you run Wordalizer from a French installation of InDesign. In other respects, the available Parser language (which is locked in the TRY version) fits your InDesign locale if the language is supported. Otherwise the default selected language is English. For instance, Spanish InDesign users get a Spanish parser although the dialog box is displayed in English. But Finnish users get both the interface and the parser locked for the English language.
• Why does Wordalizer regard ‘dog’ and ‘dogs’ as two different words?
[BOTH VERSIONS] The text parser is only based on morphological routines. It scans the characters of the supplied text and counts the occurrences of each extracted word. If the “Skip Usual Words” box is checked, it also removes many blacklisted terms, but it does not implement a stemming procedure that would unite the syntactic variants of a word —such as plural, verbal forms, etc. If you use the PRO version, you can manually regroup the words as required by editing the weighted list.
• Will Wordalizer join two words as a single entry if I use a non-breaking space between the words?
[BOTH VERSIONS] With the exceptions of hyphens, the text parser ignores most of the ‘non-alphabetic’ characters (which depend on the selected language). That's to say that spaces, non-breaking spaces, punctuation, bullets, and any similar characters are regarded as word separators. However the PRO version allows you to insert extra characters through the word list editor, which always supersedes the parser restrictions.
• Why couldn't Wordalizer manage more than 100 words?
[BOTH VERSIONS] This limitation is empirical. It follows from the fact that Wordalizer requires a lot of computing resources. However, this word counter will probably be increased in future releases.
• Is there a way to add my own color schemes in the Theme list?
[BOTH VERSIONS] Not yet! When it generates a word cloud for the theme you've selected, Wordalizer adds the required colors in the Swatches panel:
Of course you can control the colors of the word cloud at this level. Now if you really like your customized theme and want to reuse it, an option is to save the swatches in a .ase file so you can easily restore your set later, then apply the Steve Wareham's Swatch Switcher script on future word clouds built with the same theme. (Anyway, I concede that this workaround is not comfortable. . .)
• I occasionally get an error #55, that the method ‘path’ is not supported, and Wordalizer stops!
[BOTH VERSIONS] This issue as been reported by several users and we are still working to address it. The bug appears when the word list contains an item having about 25 characters or more. It does not seem to be platform-specific. If you encounter this problem with
Wordalizer PRO and need to manage long words, please contact our tech support. We will send you a temporary patch.
• InDesign becomes instable or crashes when I run Wordalizer repeatedly.
• My source text contains some letters with dot above and/or dot below. The font I select supports those characters but they don't appear in the word cloud.
[BOTH VERSIONS] Wordalizer supports several character sets which depend on the language you select in the Parser panel. The Latin languages have been adjusted to allow most of the characters used in their typical lexical space. They contain the Unicode blocks referred to as Basic Latin, Latin1 Supplement, Latin Extended-A, and Latin Extended-B (up to
U+02AF). But they do not include additional blocks such a “Latin Extended Additional” (from
U+1EFF). Adding more Unicode ranges has a performance cost, so I'm looking at a solution that optionally extends the character set.
• For more details about Wordalizer, see the main page: