Activeloop Hub, Deep Lake
Created: 15 Dec 2022, 11:03 AM | Modified: =dateformat(this.file.mtime,"dd MMM yyyy, hh:mm a")
Tags: knowledge, tools
[Summary paper]
AN OVERVIEW OF THE DATA-LOADER LANDSCAPE: COMPARATIVE PERFORMANCE ANALYSIS
https://arxiv.org/pdf/2209.13705.pdf
- Interestingly, we have observed that some libraries such as Hub (Team, 2022a) and Deep Lake (Hambardzumyan et al., 2022), achieve better performance over the network than the default Pytorch dataloader reading data locally for some scenarios. This is possible because the dataloader manages to pre-fetch the required data before the GPU needs it. We will offer a more extensive discussion in Section 5.
[Deep Lake paper]
Deep Lake: a Lakehouse for Deep Learning
https://arxiv.org/pdf/2209.10785.pdf
Cannot run on Conti network due to ssl error:
HTTPSConnectionPool(host=‘app.activeloop.ai’, port=443): Max retries exceeded with url: /api/user/token (Caused by SSLError(SSLCertVerificationError(1, ‘[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: self signed certificate in certificate chain (_ssl.c:1129)‘)))