Encoded Summarization: Summarizing Documents into Continuous Vector Space for Legal Case Retrieval
Vu TRAN, Minh Le NGUYEN, Satoshi TOJO et Ken SATOH, « Encoded Summarization: Summarizing Documents into Continuous Vector Space for Legal Case Retrieval », (2020) 28-4 Artificial Intelligence and Law 441, DOI: 10.1007/s10506-020-09262-4.
We present our method for tackling a legal case retrieval task by introducing our method of encoding documents by summarizing them into continuous vector space via our phrase scoring framework utilizing deep neural networks. On the other hand, we explore the benefits from combining lexical features and latent features generated with neural networks. Our experiments show that lexical features and latent features generated with neural networks complement each other to improve the retrieval system performance. Furthermore, our experimental results suggest the importance of case summarization in different aspects: using provided summaries and performing encoded summarization. Our approach achieved F1 of 65.6% and 57.6% on the experimental datasets of legal case retrieval tasks.