Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

uart: BW improvements #4620

Merged
merged 37 commits into from
Dec 10, 2018
Merged

uart: BW improvements #4620

merged 37 commits into from
Dec 10, 2018

Conversation

d-a-v
Copy link
Collaborator

@d-a-v d-a-v commented Apr 8, 2018

read_char straightly use hw buffer (+ ~10%bw)
read by block (+ ~190%bw) (instead of generic Stream::readBytes)
attributes for functions called by ISR
remove overrun message
remove some ISR flags which were not honored

d-a-v added 4 commits April 6, 2018 15:59
read_char straightly use hw buffer (+ ~10%bw)
read by block (+ ~190%bw) (instead of generic Stream::readBytes)
attributes for functions called by ISR
remove overrun message
remove some ISR flags which were not honoured
@d-a-v
Copy link
Collaborator Author

d-a-v commented Apr 8, 2018

@mribble @jasoroony Since you are dealing with low latency or high bandwidth serial setup,
can you check if this PR does not break things ?

@ghost
Copy link

ghost commented Apr 8, 2018

My FTP application doesn't seem to have any problem with this change. I also used the new readBytes method when sending file-chunks, works well, but it didn't improve the performance. With WiFi, SD-Card and Serial all in the mix I didn't really expect it to.

@@ -268,9 +314,16 @@ uart_start_isr(uart_t* uart)
// triggers the IRS very often. A value of 127 would not leave much time
// for ISR to clear fifo before the next byte is dropped. So pick a value
// in the middle.
USC1(uart->uart_nr) = (100 << UCFFT) | (0x02 << UCTOT) | (1 <<UCTOE );
// update: with direct peeking and loopback test @ 3Mbauds/8n1 (=2343Kibits/s):
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't understand the code changes below, or this explanation. Considering that you seem to have investigated this in detail, could you please explain the new code, and the meaning of the registers? (future reference)

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These are the UITO's brothers above. I will reenable them but treat the interrupt differently.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

error interrupts are now managed, Serial.hasRxError() tells if receive error occured.

@@ -252,7 +298,7 @@ uart_isr(void * arg)
ETS_UART_INTR_DISABLE();
return;
}
if(USIS(uart->uart_nr) & ((1 << UIFF) | (1 << UITO)))
if(USIS(uart->uart_nr) & (1 << UIFF))
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't understand this code change. Could you please add comments to explain this new condition?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I dont understand why this flag (UITO = RX FIFO timeout) triggers buffer copy, it has nothing to do. It is the role of UIFF alone (hw-rx-fifo full of 100 chars trigger).
What this interrupt could trigger is a flag in our future global sanity register stating that there are decoding error on the uart line.

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Doesn't UITO trigger when the RX FIFO is not empty, but also not "full" and some time has passed. That would be used in simpler libraries that only allowed access to the software buffer and needed the ISR to copy them over.

cores/esp8266/uart.c Show resolved Hide resolved
cores/esp8266/uart.c Show resolved Hide resolved
cores/esp8266/uart.c Show resolved Hide resolved
cores/esp8266/uart.c Show resolved Hide resolved
@devyte
Copy link
Collaborator

devyte commented Apr 10, 2018

@d-a-v Nice work with this, the code changes make a lot of sense. My comments are mostly some minor refactoring and inline comments to explain what is going on for future reference.

@ghost
Copy link

ghost commented Apr 10, 2018

I've been inspired by all this, somewhat.. I had already written a C program to calculate the actual baud rates generated by the two host MCUs I'm testing with (along with the third ESP8266 they talk to)...

Comparing the bauds that came out equal...

One doesn't like baud rates higher than 1.5MHz, but the other goes up to 3MHz (not mentioning names, because the "better" one in this case has massive jumps at this high end which in my mind makes them more equal, except in peculiar circumstances.. and ones that are hard to work out).

I am confident I'm running into problems where reading a single character at a time (my 2 main test sketches that check each character is +1 from the last) are taking too long to deal with @ 3MHz. FTP also doesn't work, even with the readBytes method that improves performance... But don't fret, I think it was sending the file (I.E. ESP8266 reads the WiFi and sends Serial data - so it's the host MCU that I think has the problem there).

But.. A simulated DoS attack with the test sketches at this high speed does make the ESP8266 lose track of things (even with a 4KB software buffer). So, I would agree, anything that makes a tight loop reading a single character faster might be useful (regarding the inline comment above for reading a single character).

Speed wise, 2.3MHz -> 3MHz Serial makes the FTP daemon go from 145KB/s to 180KB/s. Just FYI.

@mribble
Copy link
Contributor

mribble commented Apr 10, 2018

My test isn't really high bandwidth, but it does bursts of a few hundred bytes at a time. I checked with this change and it continues to work well.

Copy link
Collaborator

@devyte devyte left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some of the changes to the register settings aren't clear, and strictly speaking should be explained in comments, but given the stress testing done, this needs to be merged asap.

libraries/esp8266/examples/serialStress/serialStress.ino Outdated Show resolved Hide resolved
@d-a-v d-a-v changed the title WIP: uart: BW improvements uart: BW improvements Dec 7, 2018
@d-a-v
Copy link
Collaborator Author

d-a-v commented Dec 7, 2018

remove WIP from title

devyte and others added 4 commits December 9, 2018 23:21
…ses local tests, not real CI)

script-comment new recipe.hooks.core.prebuild.3 (along with already commented .1 and .2)
moved CI package test to be first on the test list
remove 'set -x', wish me luck
@devyte devyte merged commit 4c8d8f1 into esp8266:master Dec 10, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants