Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[SPARK-5523] [CORE] [STREAMING] Add a cache for hostname in TaskMetri…
…cs to decrease the memory usage and GC overhead Hostname in TaskMetrics will be created through deserialization, mostly the number of hostname is only the order of number of cluster node, so adding a cache layer to dedup the object could reduce the memory usage and alleviate GC overhead, especially for long-running and fast job generation applications like Spark Streaming. Author: jerryshao <[email protected]> Author: Saisai Shao <[email protected]> Closes apache#5064 from jerryshao/SPARK-5523 and squashes the following commits: 3e2412a [jerryshao] Address the comments b092a81 [Saisai Shao] Add a pool to cache the hostname
- Loading branch information