-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Vector tiles are too large #34
Comments
Still having problems with exporting geometry packed enough. First order of business:
|
@strk Would be great if you could help me solve this problem on Friday! |
I'm not surprised that a vectorized version of a raster is bigger then the raster. In a raster every pixel can be a single byte (depending on the amount of information stored for every pixel) while the vectorized version has at least 93 bytes in WKB form (and 14 in TWKB form). So how much would a vectorization benefit in size highly depends on how many polygons are needed to represent how many pixels. Remember also that adjacent polygons have duplicated boundary which makes things even worst in terms of size. So, going with a raster representation is IMHO a good idea. The whole dataset is composed roughly by ~5,500 values (one value per day for 15 years) per pixel, At any given time the user would probably only be looking at a smaller resolution version of the raster and for a less detailed time-resolution too, so we probably don't want to transfer the whole data to the client at once, for rendering on the client. What is being served right now is not clear to me. I understand protobuffer-encoded mapnik vector tiles containing a vectorized version of the raster are being sent, but dunno yet the amount of data they contain (which resolution, how many timeslots). Will be setting up some debugging tools to figure that out and add more info here as I have them. |
Possible solutions (cont'd):As per discussion with @strk, there are several ways to do this, all with their own pro's and con's. This is an overview of those: Raster tiles animated with CSS transitionsSimply generating raster tiles for each slice. Need to serve them continuosly and transition with CSS. Problems: Frames-per-second: If we're doing 1fps, and the area is say 25 tiles, then we need to serve 25 tiles per second (and transition). Each tile is approx. 25kB, ie. Tiles will have to be fetched with websockets (at 625kB/s), due to number of requests. Video tilesGenerating a 256x256 video (per tile) of arbitrary time-range on demand, adding this to tile grid. Problems: Format of video must be investigated - Single videoGenerating a video with same pixel ratio as original raster, adding to map by georeferenced frame. Problems: The video could become quite big. 800x1400 pixels. What about streaming, instead of contained file? Vector tiles with geometry once and changed values only |
Goal
The goal here is to create animations with snow-rasters. We have GeoTIFF's of snow coverage, one raster per day, for fifteen years back. We want to be able to play animation of this coverage - preferably from any given period.
Vector tile size
Working on "snow-raster" datasets, vector tiles are up to several megabytes (which is kind of insane, given the original raster is a mere 1.6MB.) Should be optimized as much as possible, preferably down to a few kB per tile, although this might be but a dream...
It seems that the vectorized dataset is much larger than the original raster, when vectorized from pixels to polygons. A single vector tile can be as much as 10MB from this 1.6MB raster file.
Having second thoughts about the whole approach.
Possible solutions
1. Points instead of polygons
Vectorizing the raster into points instead of polygons should save us some space. However, the points would have to be represented as squares, and it seems messy.
2. GIF-tiles
Crazy idea, creating 256x256 GIF's on demand on server, and playing them as tiled images in browser. Problems: hard to control playback (if at all possible); stupid solution in general; no dynamic control of style, nor querying of data.
3. Playing raster tiles with transition
Perhaps the easiest solution, after all, but hard to say if it will work without having tried. Seems the size of raster tiles might be smaller than vector tiles (contrary to intuition), so at least an improvement in that regard. Also, no dependency on GL.
4. Heavy simplification of polygons.
Currently simplification of polygons are not really working, perhaps because all polygons are perfect squares, so not much to simplify without dropping polygon altogether. Another way to simplify polygons, would be to combine four of them into one large polygon (with average value). This will lose a lot of the resolution, however - but something's gotta give!
5. TopoJSON?
Instead of storing all four corners of polygons, perhaps store only one corner (similar to what TopoJSON does). This is feasible for vectorized rasters, and will cut size - in theory - by a factor of four. (Which is not enough, however.) It seems there's a lot to gain with simplification in general, though.
6. D3.js
Eureka: The polygons are always the same - only the value
val
changes in the timeseries! Thus, no need to have several representations of polygons, only need theval
andid
. This saves massive bytes.The geometry of the
datacube
(2D datasets over time) could be represented as a TopoJSON, created from vectorized raster - then all the values (val
) of each polygon (id
) for each slice (of the timeseries) could be queried at will and colored in by d3.js.Tests and debug tools
The text was updated successfully, but these errors were encountered: