The component of ignoring two intervening characters is less mysterious to me. For example, a numbered list like “1. first_token 2. second_token …” would need this pattern. I am wondering mostly why the specific map from b’xa1′-b’xba’ to a-z is learned.
The component of ignoring two intervening characters is less mysterious to me. For example, a numbered list like “1. first_token 2. second_token …” would need this pattern. I am wondering mostly why the specific map from b’xa1′-b’xba’ to a-z is learned.