build: 💚 fix --fetchDocuments option & textes loi parsing
- Command line does not crash anymore with the
--fetchDocuments
argument - Add more filter on document uid & type before extracting textes, avoiding
MION*
type - Add new
Dataset.clean()
method - Make some functions in
clean_reorganized_data.ts
async