Fig. 2

Filtering steps from EHR documents related to early psychosis intervention services. First, we retain documents with length and average line length (avg_line_length) greater than a certain threshold. Then, we keep documents including at least one psychosis symptom keyword (from a list of predefined keywords). Finally, we retain documents containing more than five time expressions (as automatically extracted by a rule-based system)