'Matt Bossenbroek' via Datomic
2015-10-13 15:18:09 UTC
We've been seeing the following error crop up semi-frequently recently. Restarting the peer fixes it, but that's obviously not a long term solution.
This is the error we see on the peer:
clojure.lang.ExceptionInfo: :db.error/transactor-unavailable Transactor not available {:db/error :db.error/transactor-unavailable}
at datomic.peer$transactor_unavailable.invoke(peer.clj:186)
at datomic.peer.Connection.transactAsync(peer.clj:349)
at datomic.peer.Connection.transact(peer.clj:332)
at datomic.api$transact.invoke(api.clj:90)
And this is the error we see on the transactor (ip address changed, but it's the peer):
2015-10-12 02:06:55.505 WARN default org.hornetq.core.client - HQ212037: Connection failure has been detected: HQ119014: Did not receive data from /10.0.0.0:59725. It is likely the client has exited or crashed without closing its connection, or the network between the server and client has failed. You also might have configured connection-ttl and client-failure-check-period incorrectly. Please check user manual for more information. The connection will now be closed. [code=CONNECTION_TIMEDOUT]
2015-10-12 02:06:55.505 WARN default org.hornetq.core.server - HQ222061: Client connection failed, clearing up resources for session 4520159d-7085-11e5-8b24-1fc2cdcb56a5
2015-10-12 02:06:55.505 WARN default org.hornetq.core.server - HQ222107: Cleared up resources for session 4520159d-7085-11e5-8b24-1fc2cdcb56a5
2015-10-12 02:06:55.506 WARN default org.hornetq.core.server - HQ222061: Client connection failed, clearing up resources for session 4524f79e-7085-11e5-8b24-1fc2cdcb56a5
2015-10-12 02:06:55.507 WARN default org.hornetq.core.server - HQ222107: Cleared up resources for session 4524f79e-7085-11e5-8b24-1fc2cdcb56a5
2015-10-12 02:06:55.507 WARN default org.hornetq.core.server - HQ222061: Client connection failed, clearing up resources for session 465decd0-7085-11e5-8b24-1fc2cdcb56a5
2015-10-12 02:06:55.507 WARN default org.hornetq.core.server - HQ222107: Cleared up resources for session 465decd0-7085-11e5-8b24-1fc2cdcb56a5
2015-10-12 02:06:55.508 WARN default org.hornetq.core.server - HQ222061: Client connection failed, clearing up resources for session 466084e1-7085-11e5-8b24-1fc2cdcb56a5
2015-10-12 02:06:55.508 WARN default org.hornetq.core.server - HQ222107: Cleared up resources for session 466084e1-7085-11e5-8b24-1fc2cdcb56a5
Some googling of the error turned up two possibilities: 1) the laptop sleep problem (not applicable here because these are both AWS instances), and 2) the transactor is under some pressure.
Looking into the second, here's a graph of AvailableMB over the past week, where you can see it oscillating between 15-20GB:
This looks fine to me & the only errors I see on the transactor are the above client warnings. There's literally nothing logged as ERROR.
Also weird is that a peer restart fixes the issue. If it truly were the transactor under pressure, I would expect that to not remedy the issue.
Any thoughts where to look next? This is with datomic pro 0.9.5130.
Thanks,
Matt
This is the error we see on the peer:
clojure.lang.ExceptionInfo: :db.error/transactor-unavailable Transactor not available {:db/error :db.error/transactor-unavailable}
at datomic.peer$transactor_unavailable.invoke(peer.clj:186)
at datomic.peer.Connection.transactAsync(peer.clj:349)
at datomic.peer.Connection.transact(peer.clj:332)
at datomic.api$transact.invoke(api.clj:90)
And this is the error we see on the transactor (ip address changed, but it's the peer):
2015-10-12 02:06:55.505 WARN default org.hornetq.core.client - HQ212037: Connection failure has been detected: HQ119014: Did not receive data from /10.0.0.0:59725. It is likely the client has exited or crashed without closing its connection, or the network between the server and client has failed. You also might have configured connection-ttl and client-failure-check-period incorrectly. Please check user manual for more information. The connection will now be closed. [code=CONNECTION_TIMEDOUT]
2015-10-12 02:06:55.505 WARN default org.hornetq.core.server - HQ222061: Client connection failed, clearing up resources for session 4520159d-7085-11e5-8b24-1fc2cdcb56a5
2015-10-12 02:06:55.505 WARN default org.hornetq.core.server - HQ222107: Cleared up resources for session 4520159d-7085-11e5-8b24-1fc2cdcb56a5
2015-10-12 02:06:55.506 WARN default org.hornetq.core.server - HQ222061: Client connection failed, clearing up resources for session 4524f79e-7085-11e5-8b24-1fc2cdcb56a5
2015-10-12 02:06:55.507 WARN default org.hornetq.core.server - HQ222107: Cleared up resources for session 4524f79e-7085-11e5-8b24-1fc2cdcb56a5
2015-10-12 02:06:55.507 WARN default org.hornetq.core.server - HQ222061: Client connection failed, clearing up resources for session 465decd0-7085-11e5-8b24-1fc2cdcb56a5
2015-10-12 02:06:55.507 WARN default org.hornetq.core.server - HQ222107: Cleared up resources for session 465decd0-7085-11e5-8b24-1fc2cdcb56a5
2015-10-12 02:06:55.508 WARN default org.hornetq.core.server - HQ222061: Client connection failed, clearing up resources for session 466084e1-7085-11e5-8b24-1fc2cdcb56a5
2015-10-12 02:06:55.508 WARN default org.hornetq.core.server - HQ222107: Cleared up resources for session 466084e1-7085-11e5-8b24-1fc2cdcb56a5
Some googling of the error turned up two possibilities: 1) the laptop sleep problem (not applicable here because these are both AWS instances), and 2) the transactor is under some pressure.
Looking into the second, here's a graph of AvailableMB over the past week, where you can see it oscillating between 15-20GB:
This looks fine to me & the only errors I see on the transactor are the above client warnings. There's literally nothing logged as ERROR.
Also weird is that a peer restart fixes the issue. If it truly were the transactor under pressure, I would expect that to not remedy the issue.
Any thoughts where to look next? This is with datomic pro 0.9.5130.
Thanks,
Matt
--
You received this message because you are subscribed to the Google Groups "Datomic" group.
To unsubscribe from this group and stop receiving emails from it, send an email to datomic+***@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
You received this message because you are subscribed to the Google Groups "Datomic" group.
To unsubscribe from this group and stop receiving emails from it, send an email to datomic+***@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.