There are no changes in DNS configuration of these domains (which we are aware of) and there are no reachability issues outside bitrise.io, however such usages were not as frequent as from bitrise builds.
We will check this issue if it’s source is on our side. These VMs are hosted on Google Compute Engine with the standard DNS configuration, so there is a very small probability that this issue comes from us. Thank you for the report
OK, I’ve also set up an external DNS reachability monitor at port-monitor.com. It performs checking every minute, so if issue occurs again we’ll know whether it was only on bitrise.io or globally.
Issue appeared again on 2017-10-27T18:28:50Z (+20/-0 s).
Here is the build log: https://www.bitrise.io/build/19b44839b2e52522
That domain has not been monitored however, there is 2nd one monitored and handled by the same DNS servers, pointing to the different IP and that one has 100% uptime.
Additionally, there was another issue about 2017-10-27T18:33:00Z (+60/-0 min).
In this build: https://www.bitrise.io/build/924f50f7c14e28ab
That build was aborted due to timeout (75 min) but, it usually takes 20-25 min.
All that indicates that something went wrong on bitrise.io side.
Is it always the same URL which fails? Asking because we did not see any DNS resolution issues, in fact we had no DNS resolution issue in any of our continuous “control” builds.
If it’s always the same URL which fails then it’s more likely that the issue is either on that service’s side, or somewhere in-between (e.g. on domain / DNS server level), but not on bitrise.io / on the bitrise.io build workers.
Hard to say. Keep in mind that these build VMs are always clean, in every build, meaning if there’s a blip/temp DNS issue that might not affect your monitoring service as it already resolved the IP, but an environment which never resolved the IP might be affected. AFAIK this might happen for example if the DNS - IP refresh frequency is too low. But again quite hard to say as we can’t reproduce this issue
Can you add a retry in the related Script for the call? We don’t see any issue with any of our tests and no reports either. Would help a lot if you could add a retry there and let us know if that helps or not.