Skip to content

Latest commit

 

History

History
10 lines (7 loc) · 626 Bytes

File metadata and controls

10 lines (7 loc) · 626 Bytes

Machine-Learning-In-Law

This project explores the use of ML in the legal sector.

  1. A legal document is provided in pdf format. Usually multiple pages.
  2. The function extracts the texts in the legal document (it can be modified to extract images too but this is beyond the scope of this work)
  3. A series of fucntions utilizes list comprehensions and regex to clean the texts.
  4. (a.) Topic modelling is carried out on the clean texts to extract subject-matter and themes of the document using LatentDirichletAllocation (LDA). (b.) A soft-text summarization is also carried out, supported by wordcloud display.