Guile 3.0.7 test suite fails non-deterministically

  • Open
  • quality assurance status badge
Details
5 participants
  • Michael Jury
  • Carl Dong
  • Leo Famulari
  • Ludovic Courtès
  • Maxime Devos
Owner
unassigned
Submitted by
Michael Jury
Severity
important
Merged with
M
M
Michael Jury wrote on 13 May 2021 01:51
Guix pull failed
(address . bug-guix@gnu.org)
CABrMUgfcbdWoXa5cGyJPq7Gya-PMZOf0BXC+TuOROEECM8xBcg@mail.gmail.com
guix pull: error: You found a bug: the program
'/gnu/store/0bgadpjvbbr1cs2pr9z4kqizx3k7smil-compute-guix-derivation'
failed to compute the derivation for Guix (version:
"6a724b0277f31245a0573c8347a6462786d7c0d1"; system: "x86_64-linux";
host version: "55aa50834fe07597adddbfc4e007de9420b195ec"; pull-version: 1).
Please report it by email to <bug-guix@gnu.org>.

Let me know if you want the backtrace or anything else

--
Best
-Michael Jury
"The question isn't who will let me, the question is who will stop me."
Attachment: file
L
L
Leo Famulari wrote on 13 May 2021 04:30
(no subject)
(address . control@debbugs.gnu.org)
YJyPU/82FDnmPFW8@jasmine.lan
merge 48392 48391 48390 48389
M
M
Michael Jury wrote on 13 May 2021 22:16
Never mind.
(address . 48389@debbugs.gnu.org)
CABrMUgfBAjR=G7SEA1YAwt96t9zwmp1GVtpPjH0wM=szfRonsg@mail.gmail.com
After a re-boot it went away.

I had tried pulling for a while assuming a network issue.
But didn't try a reboot.
Attachment: file
L
L
Ludovic Courtès wrote on 13 May 2021 22:53
control message for bug #48389
(address . control@debbugs.gnu.org)
87tun6s6dn.fsf@gnu.org
merge 48389 48395
quit
L
L
Ludovic Courtès wrote on 13 May 2021 22:58
(address . control@debbugs.gnu.org)
875yzms64i.fsf@gnu.org
retitle 48389 Guile 3.0.7 test suite fails non-deterministically
quit
L
L
Ludovic Courtès wrote on 13 May 2021 22:58
(address . control@debbugs.gnu.org)
874kf6s64f.fsf@gnu.org
severity 48389 important
quit
L
L
Ludovic Courtès wrote on 13 May 2021 23:02
Re: bug#48389: Guix pull failed
(name . Leo Famulari)(address . leo@famulari.name)
87pmxuqrd9.fsf_-_@gnu.org
Hi,

Leo Famulari <leo@famulari.name> skribis:

Toggle quote (12 lines)
> On Wed, May 12, 2021 at 09:34:15PM -0400, Leo Famulari wrote:
>> Guile 3.0.7 failed to build on the build farm, during the test suite.
>>
>> There's nothing to link to on ci.guix.gnu.org because Cuirass can't
>> evaluate Guix without Guile.
>>
>> I've attached the build log.
>
> I logged in to berlin.gnu.org and tried building the Guile derivation
> again. It failed once, then succeeded, then failed a few more times. So
> there is some nondeterminism in the test suite.

Ouch, apologies for that (I did test on all 4 platforms before pushing
and didn’t stumble upon this issue.)

The log you posted is mangled. Do you have a clearer one at hand?

Toggle quote (2 lines)
> At least now there is a substitute for it.

Yeah.

Thanks,
Ludo’.
L
L
Leo Famulari wrote on 14 May 2021 03:26
(name . Ludovic Courtès)(address . ludo@gnu.org)
YJ3RvkLzMIqfmjHw@jasmine.lan
On Thu, May 13, 2021 at 11:02:42PM +0200, Ludovic Court�s wrote:
Toggle quote (2 lines)
> The log you posted is mangled. Do you have a clearer one at hand?

I downloaded it from here, and it works:


But now I've attached it uncompressed; it's only 281 KiB.
L
L
Ludovic Courtès wrote on 14 May 2021 11:44
(name . Leo Famulari)(address . leo@famulari.name)
87wns1ps3x.fsf@gnu.org
Hi Leo,

Leo Famulari <leo@famulari.name> skribis:

Toggle quote (4 lines)
> Running suspendable-ports.test
> UNRESOLVED: suspendable-ports.test: non-revealed port is closed
> ERROR: suspendable-ports.test: revealed port fdes not closed - arguments: ((system-error "seek" "~A" ("Bad file descriptor") (9)))

I can’t seem to reproduce it in the Guile build tree, neither with:

while ./check-guile suspendable-ports.test ; do : ; done

nor with:

while ./check-guile ; do : ; done

We’ll have to try to gather more info if it comes up again, or see if
there are configs that trigger it more easily.

Thanks,
Ludo’.
L
L
Leo Famulari wrote on 7 Jun 2021 21:06
(name . Ludovic Courtès)(address . ludo@gnu.org)
YL5uS5FbpjPw7q1L@jasmine.lan
On Fri, May 14, 2021 at 11:44:18AM +0200, Ludovic Courtès wrote:
Toggle quote (3 lines)
> We’ll have to try to gather more info if it comes up again, or see if
> there are configs that trigger it more easily.

It's happening again for the ungrafting evaluation:

https://ci.guix.gnu.org/eval/45377/log/raw(sorry, this log is unhelpful)

And Chris Baines sees it in the Guix Data Service too:


For now I'm doing something like `guix gc --clear-failures
/gnu/store/...-guile-3.0.7 && guix build /gnu/store/...-guile-3.0.7.drv`
in a loop on the Berlin server.
C
C
Carl Dong wrote on 15 Jul 2021 23:17
Guile 3.0.7 test suite fails non-deterministically
(address . 48389@debbugs.gnu.org)
4AF272E1-1F51-4576-8CAF-EB8246CDDD36@carldong.me
Hi all,

I’m also encountering this exact error when time-machining from 0961807915d7955e70a081272d857b6cbf92c0c8 to 83d9e2ee56cb0ca2a644074ad232503e25ac7116

I’m able to reproduce this (I’ve tried building the derivation 5 times now with different --cores, and this error happens every time)


My system:
- AMD Ryzen Threadripper 2970WX 24-Core Processor
- Guix on Arch Linux
- tmpfs mounted on /tmp

Cheers,
Carl Dong
contact@carldong.me
"I fight for the users"
M
M
Maxime Devos wrote on 19 Jul 2021 13:05
‘non-revealed port is closed’ test case is already fixed
0ca8049ef1812d2d4079d57c5c9fb9ea874abc4d.camel@telenet.be
-----BEGIN PGP SIGNATURE-----

iI0EABYKADUWIQTB8z7iDFKP233XAR9J4+4iGRcl7gUCYPVccRccbWF4aW1lZGV2
b3NAdGVsZW5ldC5iZQAKCRBJ4+4iGRcl7rVeAQDMgKFXqRr2qm32DypkLzXnDhdA
64HqcDXM5wx6fidxkwD+IiuO58YfdwKqfSQe6uM9f5WxH0g6Q45MbWxX8Q/hzAs=
=gjmC
-----END PGP SIGNATURE-----


?