Exclude segments based on a stoplist¶
Goal¶
Filter out segments based on a stoplist.
Prerequisites¶
Some text has been imported in Orange Textable (see Cookbook: Text input) and it has been segmented into words (see Cookbook: Segment text in smaller units).
Ingredients¶
Widget
Icon
Quantity
2
2
1
Procedure¶

Figure 1: Exclude segments based on a stoplist with instances of Text Field, Segment and Intersect¶
Create an instance of Text Field and paste into it the stoplist you want to use.
Follow the indications given in Cookbook: Segment text in smaller units in order to segment the stoplist into words.
Create an instance of Intersect.
Drag and drop from the output (righthand side) of the widget that emits the segmentation to be filtered, here Segment (words), to the input of Intersect (lefthand side).
Double-click on the icon of Intersect to open its interface.
In the Intersect section, choose Mode: Exclude.
In the Source segmentation field, choose the segmentation to be filtered (here: words); in the Filter segmentation field, choose the segmentation containing the stopwords (here: stopwords).
Click the Send button or tick the Send automatically checkbox.
A segmentation containing the filtered segmentation is then available at the output of Intersect; to display or export it, see Cookbook: Text output.
Comment¶
Stopword lists for various languages can be found here.