Skip to content

[CI] Flaky unit test for SHAP #6074

Closed
@hcho3

Description

@hcho3

https://xgboost-ci.net/blue/organizations/jenkins/xgboost-win64/detail/PR-6019/6/pipeline

[2020-08-31T06:09:47.714Z] C:\tools\miniconda3\envs\win64_6ad3a277178e49eab07729d2452d1373\lib\site-packages\hypothesis\core.py:642: 
[2020-08-31T06:09:47.714Z] _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
[2020-08-31T06:09:47.714Z] 
[2020-08-31T06:09:47.714Z] self = <hypothesis.core.StateForActualGivenExecution object at 0x000002A00CD50F08>
[2020-08-31T06:09:47.714Z] message = "Hypothesis test_shap(self=<test_gpu_prediction.TestGPUPredict testMethod=test_shap>, num_rounds=141, dataset=sparse, ...el_tree': 10}, all_rows=True) produces unreliable results: Falsified on the first call but did not on a subsequent one"
[2020-08-31T06:09:47.714Z] 
[2020-08-31T06:09:47.714Z]     def __flaky(self, message):
[2020-08-31T06:09:47.714Z]         if len(self.falsifying_examples) <= 1:
[2020-08-31T06:09:47.714Z] >           raise Flaky(message)
[2020-08-31T06:09:47.714Z] E           hypothesis.errors.Flaky: Hypothesis test_shap(self=<test_gpu_prediction.TestGPUPredict testMethod=test_shap>, num_rounds=141, dataset=sparse, param={'max_depth': 10, 'max_leaves': 106, 'num_parallel_tree': 10}, all_rows=True) produces unreliable results: Falsified on the first call but did not on a subsequent one
[2020-08-31T06:09:47.714Z] 
[2020-08-31T06:09:47.714Z] C:\tools\miniconda3\envs\win64_6ad3a277178e49eab07729d2452d1373\lib\site-packages\hypothesis\core.py:847: Flaky
[2020-08-31T06:09:47.714Z] --------------------------------- Hypothesis ----------------------------------
[2020-08-31T06:09:47.714Z] Falsifying example: test_shap(
[2020-08-31T06:09:47.714Z]     self=<test_gpu_prediction.TestGPUPredict testMethod=test_shap>,
[2020-08-31T06:09:47.714Z]     num_rounds=141,
[2020-08-31T06:09:47.714Z]     dataset=sparse,
[2020-08-31T06:09:47.714Z]     param={'max_depth': 10, 'max_leaves': 106, 'num_parallel_tree': 10},
[2020-08-31T06:09:47.714Z]     all_rows=True,
[2020-08-31T06:09:47.714Z] )
[2020-08-31T06:09:47.714Z] Failed to reproduce exception. Expected: 
[2020-08-31T06:09:47.714Z] Traceback (most recent call last):
[2020-08-31T06:09:47.714Z]   File "C:\Users\Administrator\workspace\xgboost-win64_PR-6019\tests\python-gpu\test_gpu_prediction.py", line 219, in test_shap
[2020-08-31T06:09:47.714Z]     assert np.allclose(np.sum(shap, axis=len(shap.shape) - 1), margin, 1e-3, 1e-3)
[2020-08-31T06:09:47.714Z] AssertionError: assert False
[2020-08-31T06:09:47.714Z]  +  where False = <function allclose at 0x000002A07A04A948>(array([ 68.19425 , 233.31032 , 188.2829  , ...,  68.03375 ,  11.593885,\n        51.257465], dtype=float32), array([ 68.19389 , 233.31035 , 188.28214 , ...,  68.03336 ,  11.594096,\n        51.25665 ], dtype=float32), 0.001, 0.001)
[2020-08-31T06:09:47.714Z]  +    where <function allclose at 0x000002A07A04A948> = np.allclose
[2020-08-31T06:09:47.714Z]  +    and   array([ 68.19425 , 233.31032 , 188.2829  , ...,  68.03375 ,  11.593885,\n        51.257465], dtype=float32) = <function sum at 0x000002A07A031438>(array([[-1.3257432e-01, -4.1069903e+00,  1.6655694e-01, ...,\n         7.6787562e+00,  6.7797709e+00,  3.4963994e+00],\n...6780e-01,  3.2375448e+00, -4.1314743e-02, ...,\n         2.7299562e-01,  5.3053784e-01,  3.4963984e+00]], dtype=float32), axis=(2 - 1))
[2020-08-31T06:09:47.714Z]  +      where <function sum at 0x000002A07A031438> = np.sum
[2020-08-31T06:09:47.714Z]  +      and   2 = len((2000, 101))
[2020-08-31T06:09:47.714Z]  +        where (2000, 101) = array([[-1.3257432e-01, -4.1069903e+00,  1.6655694e-01, ...,\n         7.6787562e+00,  6.7797709e+00,  3.4963994e+00],\n...6780e-01,  3.2375448e+00, -4.1314743e-02, ...,\n         2.7299562e-01,  5.3053784e-01,  3.4963984e+00]], dtype=float32).shape

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions