Bhargav Srinivasa Desikan
Natural language processing models learn word representations based on the distributional hypothesis, which asserts that word context (e.g., co-occurrence) correlates with meaning. We propose that n-grams composed of random character sequences, or garble, ...
ASSOC COMPUTATIONAL LINGUISTICS-ACL2022