Alway profit 0 when evaluating a model #7

Kalelv45 · 2018-08-28T19:08:38Z

It always stays at profit 0.00 after I evaluate some model.
Why can this be?

kkuette · 2018-09-12T08:34:04Z

Your agent did not learn well, i think it's just overfitting.

iorobot · 2018-09-15T08:04:45Z

Look at #1.
I also have that problem but even with that solution my results using almost the same model(model_1000 or model_1010) are extremely volatile.

kkuette · 2018-09-25T22:08:38Z

Try to tweak your settings.
if you want something with more functionalities you can look at this

satinder147 · 2018-12-31T09:07:56Z

@Kalelv45 @kkuette @iorobot @edwardhdlu.I tried it myself and found that tweaking the code a little bit yields very good results. I changed the way reward was provided. If there were more that 20 consecutive buys or 50 consecutive "no action", then I gave the agent a big negative reward like -500. Apart from this I removed the max(in the buy action) and multiplied the reward by 100. I trained the model for 220 episodes and the 220th showed a profit of 25000 dollars on the last three year google stock price data.

iorobot · 2019-01-09T08:19:54Z

Nice!Can you pull the code?

chmbrs · 2019-01-17T18:28:42Z

@Kalelv45 @kkuette @iorobot @edwardhdlu.I tried it myself and found that tweaking the code a little bit yields very good results. I changed the way reward was provided. If there were more that 20 consecutive buys or 50 consecutive "no action", then I gave the agent a big negative reward like -500. Apart from this I removed the max(in the buy action) and multiplied the reward by 100. I trained the model for 220 episodes and the 220th showed a profit of 25000 dollars on the last three year google stock price data.

You mean that you removed the max function in the sell action, right?

kkuette · 2019-01-18T03:53:33Z

Are you sure that your model isnt overfitted ?

satinder147 · 2019-01-18T07:09:55Z

I removed the max function because otherwise the model is not penalized for wrong decisions. Then you can increase the size of the penalty(not everytime) by multiplying the reward it gets after selling by 100.
As far as overfitting is concerned, I have tried it with a bunch of other stocks and it is working really good.

gucciwang · 2019-01-22T10:52:06Z

What's the window size you're using? @satinder147

satinder147 · 2019-01-22T15:37:25Z

@calvin-is-seksy 10

satinder147 · 2019-02-19T07:01:34Z

I just made an pull request. You all can have a look

iorobot · 2019-03-04T17:36:29Z

Even with your code i still have 0 profit with the GSPC dataset. Any ideas why?

gucciwang · 2019-03-04T19:49:15Z

I used a window size of 20, and found that my model converges and profits the most on my 30th episode. All models after 100 seems to diverge and just never buy anything. Any ideas on how to tweak the model to further converge?

iorobot · 2019-03-05T16:59:20Z

well, it seems to me that is only a coincidence, have you try to use different data?Do you find the same behaviour?

GarfieldHuang · 2019-07-29T02:03:59Z

I used a window size of 20, and found that my model converges and profits the most on my 30th episode. All models after 100 seems to diverge and just never buy anything. Any ideas on how to tweak the model to further converge?

hello, I want to know how to plot this graph??

rui-ren · 2019-10-04T01:43:31Z

I used a window size of 20, and found that my model converges and profits the most on my 30th episode. All models after 100 seems to diverge and just never buy anything. Any ideas on how to tweak the model to further converge?

@ How about your running time? It is super slow on my laptop, I have RTX 2060 , But still 10 episodes took 1 hour. THX

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Alway profit 0 when evaluating a model #7

Alway profit 0 when evaluating a model #7

Kalelv45 commented Aug 28, 2018

kkuette commented Sep 12, 2018

iorobot commented Sep 15, 2018

kkuette commented Sep 25, 2018 •

edited

Loading

satinder147 commented Dec 31, 2018 •

edited

Loading

iorobot commented Jan 9, 2019

chmbrs commented Jan 17, 2019

kkuette commented Jan 18, 2019 •

edited

Loading

satinder147 commented Jan 18, 2019 •

edited

Loading

gucciwang commented Jan 22, 2019

satinder147 commented Jan 22, 2019

satinder147 commented Feb 19, 2019 •

edited

Loading

iorobot commented Mar 4, 2019

gucciwang commented Mar 4, 2019

iorobot commented Mar 5, 2019

GarfieldHuang commented Jul 29, 2019 •

edited

Loading

rui-ren commented Oct 4, 2019

Alway profit 0 when evaluating a model #7

Alway profit 0 when evaluating a model #7

Comments

Kalelv45 commented Aug 28, 2018

kkuette commented Sep 12, 2018

iorobot commented Sep 15, 2018

kkuette commented Sep 25, 2018 • edited Loading

satinder147 commented Dec 31, 2018 • edited Loading

iorobot commented Jan 9, 2019

chmbrs commented Jan 17, 2019

kkuette commented Jan 18, 2019 • edited Loading

satinder147 commented Jan 18, 2019 • edited Loading

gucciwang commented Jan 22, 2019

satinder147 commented Jan 22, 2019

satinder147 commented Feb 19, 2019 • edited Loading

iorobot commented Mar 4, 2019

gucciwang commented Mar 4, 2019

iorobot commented Mar 5, 2019

GarfieldHuang commented Jul 29, 2019 • edited Loading

rui-ren commented Oct 4, 2019

kkuette commented Sep 25, 2018 •

edited

Loading

satinder147 commented Dec 31, 2018 •

edited

Loading

kkuette commented Jan 18, 2019 •

edited

Loading

satinder147 commented Jan 18, 2019 •

edited

Loading

satinder147 commented Feb 19, 2019 •

edited

Loading

GarfieldHuang commented Jul 29, 2019 •

edited

Loading