|
|
3 سال پیش | |
|---|---|---|
| pr-legislation | 3 سال پیش | |
| render-html | 3 سال پیش | |
| translate | 3 سال پیش | |
| .gitignore | 3 سال پیش | |
| readme.md | 3 سال پیش |
Making the legislation process in puerto rico more transparent
http://www.tucamarapr.org/dnncamara/web/ActividadLegislativa/tramitelegislativo.aspx?measureid=XXXX
contents to be retrieved and stored as JSON files to be processed later save as documents/{measure_name}.es.json
measure {
Measure Name :: string
Date Filed :: date
Authors :: string[]
Heading :: string
History :: History[]
}
History {
Date :: date
Description :: string
Document :: string (url)
}
save contents to folders documents/{measure_name}/{history_date}.{history_description}.pdf
translate documents/{measure_name}.es.json into documents/{measure_name}.en.json
index page for es/en filter by measure id/heading substring/authors
convert json files into static html files for es/en
Translation is currently using a local build of rust-bert
To get it to work I updated openssl to 3.0 via the experimental ubuntu repo
downloaded from https://download.pytorch.org/libtorch/cu113/libtorch-shared-with-deps-1.11.0%2Bcu113.zip libtorch-cxx11-abi-shared-with-deps-1.11.0+cu113.zip and extracted it locally, pointed LIBTORCH at it (following instructions via rust-bert)
cloned rust-bert and built it via cargo build and pointed my translate cargo.toml to the extracted directory
It's currently running on CPU for the translation, my GPU is small otherwise it would be better to run it off GPU.