...
- sudo apt update
- gpg
- sudo apt install gnupg
- java
wget -qO - https://adoptopenjdk.jfrog.io/adoptopenjdk/api/gpg/key/public | sudo apt-key add -
sudo apt-get install -y software-properties-common
sudo add-apt-repository --yes https://adoptopenjdk.jfrog.io/adoptopenjdk/deb/
sudo apt-get install adoptopenjdk-8-hotspot
sudo apt-get install adoptopenjdk-11-hotspot
sudo apt-get install adoptopenjdk-14-hotspot
sudo apt-get install fontconfig (https://github.com/AdoptOpenJDK/openjdk-build/issues/693 via Dominik Stadler)
sudo apt install ttf-dejavu (same as above)
- sudo apt-get install groovy
- sudo apt-get install maven
- sudo apt-get install subversion
- sudo apt-get install git
- sudo apt-get install file
- installed docker following: https://docs.docker.com/engine/install/ubuntu/
Datasette
I semi-manually created a sqlite db that is a combination of tika-eval's Profile mode and three other tables: a complete file list, a list of mimes from 'file' and a list of mimes from Tika. In the future, I think we should keep tika-eval's profile db separate from the mimes db.
To reverse proxy /datasette:
ProxyPreserveHost On
ProxyPass /datasette http://0.0.0.0:8001
ProxyPassReverse /datasette http://0.0.0.0:8001
The .db is in /data1/publish. cd to that directory and then: docker run -d -p 8001:8001 -v `pwd`:/mnt datasetteproject/datasette datasette -p 8001 -h 0.0.0.0 /mnt/corpora-metadata.db --config sql_time_limit_ms:60000 --config base_url:/datasette/
Everything below here needs to be updated for Ubuntu
...