Spike: Slow endpoints in Arc #926

chandadharap · 2015-01-22T18:53:22Z

No description provided.

seanbrookes · 2015-02-17T06:31:30Z

Had an initial hangout with Sam today to help frame up the spike.
Some notes:

the implementation is more sophisticated than the existing StrongOps one
ideally Chrome devtools will be a suitable 'view engine' for the feature.
We have a native JSON data prototype
Open to transforming it to a more consubable format for devtools
Need to deep dive into devtools timline code to figure out best way to map the api data to it.

seanbrookes · 2015-02-17T06:32:11Z

From the iojs tracing discussion: nodejs/node#671 (comment)

seanbrookes · 2015-02-17T06:33:45Z

Chrome tracing example git hub repo: https://github.com/thlorenz/traceviewify

seanbrookes · 2015-02-17T06:35:45Z

kick-off email notes:
On Sun, Feb 15, 2015 at 10:49 PM, Chanda Dharap [email protected] wrote:

Sean is concerned that Timeline view may not work. I have a spike for him so
he can evaluate implementation strategies and whether we use Timeline View
or not.

Ok. As a heads up, I asked Anthony, who I thought integrated the last
two dev-tools based displays (cpu and heap profiling), and he said it
was Miroslav had imported the entire chrome dev tools UI into arc...
its just that arc was disabling the display of those features that we
didn't need at the moment, like Timeline view.

Can you pass him the json that represents Slow end points?

An example of the agent-internal data format is here:
https://github.com/strongloop/strongops/pull/245#issue-53570095, along
with some fairly detailed docs on the meaning of the data.

Note that since our target is the TimeLine, after having realized that
the timeline data format is actually documented
(https://docs.google.com/a/strongloop.com/document/d/1CvAClvFfyA5R-PhYUmn5OOQtYMH4h6I0nSsKchNAySU/edit#heading=h.xqopa5m0e28f)
I think it would make more sense for agent or supervisor to transmit
and store the data in .traceview format.

I think we should do a spike on transforming the data into the trace
format described above. I might take a peek at it today.

And yes... I know I said before we should just transmit the existing
data format... but that was before I knew there was a standard format,
and not just a bunch of internal-to-chrome data structures!

I also think we should implement trace-start/trace-stop (similar to
cpu-start/cpu-stop), possibly even with watchdog mode.

Sean, have you looked at Anthony's cpu-profile display tab, and how
its using the Chrome dev tools?

I'm not sure what your concern is (other than the data format agent
tosses you), timelines looks pretty similar to the other dev tools we
rehosted into arc.

seanbrookes · 2015-02-17T06:38:57Z

Trace event format doc:
https://docs.google.com/a/strongloop.com/document/d/1CvAClvFfyA5R-PhYUmn5OOQtYMH4h6I0nSsKchNAySU/edit#heading=h.xqopa5m0e28f

seanbrookes · 2015-02-17T06:44:51Z

@sam-github

sam-github · 2015-02-17T17:16:25Z

chrome's approach to "tracing":

tracing working group nodejs/node#671 (comment)

thorsten's tools, around converting various formats into the "trace event" format:

@chandadharap As I understand @seanbrookes 's concerns now, its that the Chrome Dev Tools in general are incredibly complex, giving the appearance of power, but possibly just offering complexity: both complexity in terms of integrating them, but worse, complexity in terms of user experience.

My concern is that it appears google is playing hard at creating tools that can incorporate a wide range of time-based data, from stack traces, to what we call "metrics", to what we call "slow-endpoints" (they would call them traces). We have the opportunity to make a play here that will get us a tool that will allow agent to drive _many_ kinds of data into a single unified UI (rather than creating a UI for each thing we measure :-( ... ouch). It appears io.js/v8 may also be making a play towards exposing more runtime info as trace view format, so we shouldn't choose an incompatible direction.

seanbrookes · 2015-02-17T17:20:52Z

the more I read the more I'm coming around

sam-github · 2015-02-17T17:53:35Z

OK. A major open question is still what is the similarity and differences between the chrome://tracing view, and the "Timeline" view in Dev Tools... do they share data input formats? Is the chrome://tracing and internal tool, destined for Dev Tools once cleaned up? We appear to have a number of choices:

do our own view, tailor made to our data
use chrome://tracing
use Timeline in devtools
use concurix's (maybe an option)

I don't know if its 2 vs 3, or if they are based on same code, or... what.

@seanbrookes do you think you can figure that out? @bajtos, I wonder if you know?

sam-github · 2015-02-17T18:01:34Z

@chandadharap @seanbrookes one of the issues with evaling the google displays, is they only show internal chrome FE data (by default), I'd like to do a spike (or have @seanbrookes do this) where the slow endpoint data is converted to trace view format (there are a number of options), and we load it into the view, and see what our data would look like (rather than looking at what Chrome's data looks like).

Thoughts? @seanbrookes is this something you can take on?

bajtos · 2015-02-17T18:14:21Z

@sam-github I am not familiar with tracing/timeline views, can't help here :(

sam-github · 2015-02-17T18:15:36Z

Looks like timeline is more related to .cpuprofile, and tracing to the trace data structures... but that one can be converted to the other... hm. so they are seperate choices. :-(

altsang · 2015-02-17T18:53:00Z

@sam-github @seanbrookes
the point of seeing if chrome dev tools can be used first is to save time and go to market quicker. Quicker - but not at the expense of sacrificing the UX and utility.
Also if JS developers were already familiar with using CDT to their front end work, that this would be very much the same. T
The above options for converting to a trace view look fine to me. What's nice about that is that we can potentially leverage for what we need to do around tracing. Note - I specifically brought up with Issac that we won't have a "trace" like graphical experience if we move to timeline within CDT (i.e. flamegraph) and that at best two different "paths" would be indented to show the parent path but that it may not be clear that they are completely unrelated to each other. He said this was fine .
If we think we can leverage one of the existing npm modules shown above - I'm all for it, especially if we can get to a flamegraph experience, but it sounds like there's more work to be had to transpose the data to a trace view?

seanbrookes · 2015-02-17T20:00:12Z

my first priority is to understand the data formats required by cdt and chrome://tracing

I poked around the concurix repo's last night and was encouraged to see d3(svg) code for their flame graphs

sam-github · 2015-02-18T00:40:28Z

nodejs/node#671 (comment) <---- @seanbrookes very useful context info on chrome://tracing vs. Timeline view

chandadharap · 2015-02-23T05:08:19Z

Extremely useful spike. Basically if Concurix integration goes through, @ijroth agrees that Traces will override Slow end-points.

It is still valuable enough a direction that we should backlog a Spike for understanding Traceview format and if it would work for us. It appears io.js/v8 may also be making a play towards exposing more runtime info as trace view format. A Spike on the backlog would be helpful for the medium/long-term.

Created under scrum #191. Points back to detail on traceview here.

seanbrookes added the #sprint63 label Jan 22, 2015

seanbrookes self-assigned this Jan 22, 2015

chandadharap added #tob and removed #sprint63 labels Jan 27, 2015

chandadharap added #plan and removed #tob labels Feb 5, 2015

seanbrookes added #wip and removed #sprint64 labels Feb 17, 2015

seanbrookes assigned chandadharap and unassigned seanbrookes Feb 19, 2015

chandadharap added #verify and removed #wip labels Feb 23, 2015

chandadharap closed this as completed Feb 23, 2015

chandadharap removed the #verify label Feb 23, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spike: Slow endpoints in Arc #926

Spike: Slow endpoints in Arc #926

chandadharap commented Jan 22, 2015

seanbrookes commented Feb 17, 2015

seanbrookes commented Feb 17, 2015

seanbrookes commented Feb 17, 2015

seanbrookes commented Feb 17, 2015

seanbrookes commented Feb 17, 2015

seanbrookes commented Feb 17, 2015

sam-github commented Feb 17, 2015

seanbrookes commented Feb 17, 2015

sam-github commented Feb 17, 2015

sam-github commented Feb 17, 2015

bajtos commented Feb 17, 2015

sam-github commented Feb 17, 2015

altsang commented Feb 17, 2015

seanbrookes commented Feb 17, 2015

sam-github commented Feb 18, 2015

chandadharap commented Feb 23, 2015

Spike: Slow endpoints in Arc #926

Spike: Slow endpoints in Arc #926

Comments

chandadharap commented Jan 22, 2015

seanbrookes commented Feb 17, 2015

seanbrookes commented Feb 17, 2015

seanbrookes commented Feb 17, 2015

seanbrookes commented Feb 17, 2015

seanbrookes commented Feb 17, 2015

seanbrookes commented Feb 17, 2015

sam-github commented Feb 17, 2015

seanbrookes commented Feb 17, 2015

sam-github commented Feb 17, 2015

sam-github commented Feb 17, 2015

bajtos commented Feb 17, 2015

sam-github commented Feb 17, 2015

altsang commented Feb 17, 2015

seanbrookes commented Feb 17, 2015

sam-github commented Feb 18, 2015

chandadharap commented Feb 23, 2015