Here's another idea.
Instead of another H4-ECOM100, purchase a Do-more BX-DM1E-M PLC and have it do all the comms to the 16 slaves. That's all it does - it collects the data.
Stick the results in DLV memory in the BRX (it emulates DirectLOGIC PLC octal V memory) and then have the existing H4-ECOM100 do normal DirectLogic requests (yes, octal reads like V1777) with just ONE NETRX and ONE NETWX to the BRX. Do-more PLCs have MRX/MWX instructions for Modbus/TCP. I bet this might speed it up A TON!!
*** EDIT *** You may need to synchronize the reads and writes, so it may be better for the BRX to read/write from/to the slaves and write/read to the 405 PLC also, but the 405 comm can be one with TWO comm (one big DLRX and one big DLWX).
These could be done in Stages in the BRX PLC
SG S1 // Read 16 slaves data
// 16 MRX boxes in one rung
// last one uses Stage On Success, JMP S2
SG S2 // Write all that slave data to 454 in one contiguous block using native DirectLOGIC comm - needs to be one contiguous V memory block in the 454 to reduce comm to just ONE request
DLWX On Success JMP to S3
SG S3 // Read 454 data to be written to the 16 slaves - needs to be one contiguous V memory block in the 454 to reduce comm to just ONE request
DLRX OnSuccess JMP to S4
SG S4 // write that data to the 16 slaves
// 16 MWX boxes in one rung
// last MWX uses Stage On Success to JMP S1
Stage programming at its best! 