Use SQLite to store torrent checkpoints data #5669

ichorid · 2020-10-22T10:13:41Z

Currently, we store torrent data and stats in separate files in the dlcheckpoints dir. Libtorrent never touches any of these. Instead, we mediate its access through Python code.

Moving to use PonyORM backed SQLite storage for torrents will save us a lot of hassle regarding file access synchronization/persistence, the kind of things affecting e.g. #5615. Also, it will enable simpler and tighter integration with Channels database, resulting in faster UI response.

What do you think, guys?

The text was updated successfully, but these errors were encountered:

ichorid · 2020-10-22T10:15:49Z

@egbertbouman , we need your opinion on this, as the person who touched the wrapper/checkpoint files most closely?

egbertbouman · 2020-10-22T12:10:19Z

I don't really see how the issue you referenced is tied to the fact that we're storing the metadata/stats in files. Since we have to write the metadata/stats ourselves in both situations, the incorrect data will just be in the database instead. I also don't get the benefit in terms of hassle.

If there is a performance improvement by using the database, I think it's worth investigating this. Just for my understanding. What information that we're currently getting from files, will be faster by moving it to the database?

ichorid · 2020-10-22T12:38:30Z

AFAIK, in our current design, we store the upload/download counters in-memory. We dump the data on disk only when closing Tribler, right? When opening Tribler, we load all the counters back into memory. This may cause losing ratio counters if Tribler was shut down incorrectly.

By using SQLite we can solve persistence/shutdown problems with dlcheckpoints, while simultaneously removing a lot of utility code for e.g. managing our own file format for config files. Also, this will enable us to cache torrent bencoded dicts in the DB, as we did before in times of lmdb.

xoriole · 2020-10-22T12:45:34Z

I like the idea of persisting stats but if something like this #5252 happens the user will lose their downloads (list).

egbertbouman · 2020-10-22T12:48:27Z

The stats are collected by libtorrent itself (it's part of the resumedata). When checkpointing, this data is stored in the checkpoint file, along with the metadata.

ichorid · 2020-10-22T12:51:55Z

The stats are collected by libtorrent itself (it's part of the resumedata). When checkpointing, this data is stored in the checkpoint file, along with the metadata.

You mean, the stats are stored in binary form and read directly by Libtorrent, right?

egbertbouman · 2020-10-22T13:01:29Z

Correct, when we resume a download after a restart, we give to metadata/resumedata back to libtorrent:

tribler/src/tribler-core/tribler_core/modules/libtorrent/download.py

Lines 160 to 168 in 8ad6a59

    
           metainfo = self.tdef.get_metainfo() 
        
           torrentinfo = lt.torrent_info(metainfo) 
        
           atp["ti"] = torrentinfo 
        
           if resume_data and isinstance(resume_data, dict): 
        
               # Rewrite save_path as a global path, if it is given as a relative path 
        
               if b"save_path" in resume_data and not path_util.isabs(ensure_unicode(resume_data[b"save_path"], 'utf8')): 
        
                   resume_data[b"save_path"] = self.state_dir / ensure_unicode(resume_data[b"save_path"], 'utf8') 
        
               atp["resume_data"] = lt.bencode(resume_data)

synctext · 2020-10-23T07:19:52Z

Not an expert on performance or balance design as discussed above; but....
It is vital that Tribler keeps on going, it should never crash or refuse to work. Youtube, TikTok or Spotify dont need a "repair database" button. Please enables safety checks. Losing a few bytes in the latest counters is not important compared to guarding database integrity.

Example of world-class reliability engineering by BBC

ichorid · 2020-10-23T10:25:08Z

Firefox uses SQLite internally. I guess if it is good for them, it is good for us too...

ichorid · 2020-10-25T14:01:37Z

@synctext :

Simplicity

qstokkink · 2024-08-12T08:38:02Z

We'll follow the advice of @egbertbouman and not do this.

ichorid added type: enhancement performance labels Oct 22, 2020

ichorid added this to the V7.6: Stability, usability, performance milestone Oct 22, 2020

ichorid added the needs discussion label Oct 22, 2020

ichorid changed the title ~~Use SQLite to store torrent data~~ Use SQLite to store torrent checkpoints data Oct 22, 2020

drew2a modified the milestones: 7.6.0 November: Stability, usability, performance, Next-next release Nov 4, 2020

drew2a added component: GUI component: UX and removed component: GUI component: UX labels Jan 15, 2021

drew2a modified the milestones: Next-next release, Backlog Sep 15, 2021

qstokkink mentioned this issue Jan 12, 2024

We need to go through all the TODO/FIXME entries and clean them up/do something about them #1722

Closed

qstokkink closed this as completed Aug 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use SQLite to store torrent checkpoints data #5669

Use SQLite to store torrent checkpoints data #5669

ichorid commented Oct 22, 2020

ichorid commented Oct 22, 2020

egbertbouman commented Oct 22, 2020

ichorid commented Oct 22, 2020

xoriole commented Oct 22, 2020

egbertbouman commented Oct 22, 2020

ichorid commented Oct 22, 2020

egbertbouman commented Oct 22, 2020

synctext commented Oct 23, 2020

ichorid commented Oct 23, 2020

ichorid commented Oct 25, 2020

qstokkink commented Aug 12, 2024

Use SQLite to store torrent checkpoints data #5669

Use SQLite to store torrent checkpoints data #5669

Comments

ichorid commented Oct 22, 2020

ichorid commented Oct 22, 2020

egbertbouman commented Oct 22, 2020

ichorid commented Oct 22, 2020

xoriole commented Oct 22, 2020

egbertbouman commented Oct 22, 2020

ichorid commented Oct 22, 2020

egbertbouman commented Oct 22, 2020

synctext commented Oct 23, 2020

ichorid commented Oct 23, 2020

ichorid commented Oct 25, 2020

qstokkink commented Aug 12, 2024