| Visible device: cuda |
| Seed used: 0 |
| Batch size: 64 |
| Epochs: 40 |
| Learning rate: 1e-05 |
| Entropy weight: 0.01 |
| Regularization weight: 0.0 |
| Only use multiwoz like domains: False |
| Vectorizer: Data set used is multiwoz21 |
| We filter state by active domains: True |
| Vectorizer: Data set used is multiwoz21 |
| Embedding semantic descriptions: True |
| Embedded descriptions successfully. Size: torch.Size([338, 768]) |
| Data set used for descriptions: multiwoz21 |
| We use Roberta to embed actions. |
| Didnt load a model |
| Start training |
| Epoch: 0 |
| Precision: 0 |
| Recall: 0 |
| F1: 0 |
| Best Precision: 0.0 |
| Best Recall: 0.0 |
| Best F1: 0.0 |
| Epoch: 1 |
| Precision: 0 |
| Recall: 0 |
| F1: 0 |
| Best Precision: 0.0 |
| Best Recall: 0.0 |
| Best F1: 0.0 |
| Epoch: 2 |
| Average actions: 2.4348959922790527 |
| Average target actions: 2.28125 |
| Precision: 0.043010752688172046 |
| Recall: 0.0425531914893617 |
| F1: 0.04278074866310161 |
| <<dialog policy>> epoch 2: saved network to mdl |
| Best Precision: 0.043010752688172046 |
| Best Recall: 0.0425531914893617 |
| Best F1: 0.04278074866310161 |
| Epoch: 3 |
| Precision: 0.043010752688172046 |
| Recall: 0.0425531914893617 |
| F1: 0.04278074866310161 |
| Best Precision: 0.043010752688172046 |
| Best Recall: 0.0425531914893617 |
| Best F1: 0.04278074866310161 |
| Epoch: 4 |
| Average actions: 2.4114584922790527 |
| Average target actions: 2.7890625 |
| Precision: 0.07058823529411765 |
| Recall: 0.06382978723404255 |
| F1: 0.06703910614525138 |
| <<dialog policy>> epoch 4: saved network to mdl |
| Best Precision: 0.07058823529411765 |
| Best Recall: 0.06382978723404255 |
| Best F1: 0.06703910614525138 |
| Epoch: 5 |
| Precision: 0.07058823529411765 |
| Recall: 0.06382978723404255 |
| F1: 0.06703910614525138 |
| Best Precision: 0.07058823529411765 |
| Best Recall: 0.06382978723404255 |
| Best F1: 0.06703910614525138 |
| Epoch: 6 |
| Average actions: 2.1536459922790527 |
| Average target actions: 2.5859375 |
| Precision: 0.049079754601226995 |
| Recall: 0.0425531914893617 |
| F1: 0.045584045584045586 |
| Best Precision: 0.07058823529411765 |
| Best Recall: 0.06382978723404255 |
| Best F1: 0.06703910614525138 |
| Epoch: 7 |
| Precision: 0.049079754601226995 |
| Recall: 0.0425531914893617 |
| F1: 0.045584045584045586 |
| Best Precision: 0.07058823529411765 |
| Best Recall: 0.06382978723404255 |
| Best F1: 0.06703910614525138 |
| Epoch: 8 |
| Average actions: 2.15625 |
| Average target actions: 2.5520834922790527 |
| Precision: 0.07547169811320754 |
| Recall: 0.06382978723404255 |
| F1: 0.06916426512968299 |
| <<dialog policy>> epoch 8: saved network to mdl |
| Best Precision: 0.07547169811320754 |
| Best Recall: 0.06382978723404255 |
| Best F1: 0.06916426512968299 |
| Epoch: 9 |
| Precision: 0.07547169811320754 |
| Recall: 0.06382978723404255 |
| F1: 0.06916426512968299 |
| Best Precision: 0.07547169811320754 |
| Best Recall: 0.06382978723404255 |
| Best F1: 0.06916426512968299 |
| Epoch: 10 |
| Average actions: 2.0572915077209473 |
| Average target actions: 2.3489584922790527 |
| Precision: 0.04516129032258064 |
| Recall: 0.03723404255319149 |
| F1: 0.04081632653061224 |
| Best Precision: 0.07547169811320754 |
| Best Recall: 0.06382978723404255 |
| Best F1: 0.06916426512968299 |
| Epoch: 11 |
| Precision: 0.04516129032258064 |
| Recall: 0.03723404255319149 |
| F1: 0.04081632653061224 |
| Best Precision: 0.07547169811320754 |
| Best Recall: 0.06382978723404255 |
| Best F1: 0.06916426512968299 |
| Epoch: 12 |
| Average actions: 1.984375 |
| Average target actions: 2.5520834922790527 |
| Precision: 0.08666666666666667 |
| Recall: 0.06914893617021277 |
| F1: 0.07692307692307691 |
| <<dialog policy>> epoch 12: saved network to mdl |
| Best Precision: 0.08666666666666667 |
| Best Recall: 0.06914893617021277 |
| Best F1: 0.07692307692307691 |
| Epoch: 13 |
| Precision: 0.08666666666666667 |
| Recall: 0.06914893617021277 |
| F1: 0.07692307692307691 |
| Best Precision: 0.08666666666666667 |
| Best Recall: 0.06914893617021277 |
| Best F1: 0.07692307692307691 |
| Epoch: 14 |
| Average actions: 2.0416665077209473 |
| Average target actions: 2.3828125 |
| Precision: 0.05228758169934641 |
| Recall: 0.0425531914893617 |
| F1: 0.046920821114369494 |
| Best Precision: 0.08666666666666667 |
| Best Recall: 0.06914893617021277 |
| Best F1: 0.07692307692307691 |
| Epoch: 15 |
| Precision: 0.05228758169934641 |
| Recall: 0.0425531914893617 |
| F1: 0.046920821114369494 |
| Best Precision: 0.08666666666666667 |
| Best Recall: 0.06914893617021277 |
| Best F1: 0.07692307692307691 |
| Epoch: 16 |
| Average actions: 2.1666665077209473 |
| Average target actions: 2.2135417461395264 |
| Precision: 0.1346153846153846 |
| Recall: 0.11170212765957446 |
| F1: 0.12209302325581395 |
| <<dialog policy>> epoch 16: saved network to mdl |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11170212765957446 |
| Best F1: 0.12209302325581395 |
| Epoch: 17 |
| Precision: 0.1346153846153846 |
| Recall: 0.11170212765957446 |
| F1: 0.12209302325581395 |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11170212765957446 |
| Best F1: 0.12209302325581395 |
| Epoch: 18 |
| Average actions: 1.7734375 |
| Average target actions: 2.5520834922790527 |
| Precision: 0.0661764705882353 |
| Recall: 0.047872340425531915 |
| F1: 0.05555555555555556 |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11170212765957446 |
| Best F1: 0.12209302325581395 |
| Epoch: 19 |
| Precision: 0.0661764705882353 |
| Recall: 0.047872340425531915 |
| F1: 0.05555555555555556 |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11170212765957446 |
| Best F1: 0.12209302325581395 |
| Epoch: 20 |
| Average actions: 2.1328125 |
| Average target actions: 2.6197917461395264 |
| Precision: 0.1346153846153846 |
| Recall: 0.11170212765957446 |
| F1: 0.12209302325581395 |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11170212765957446 |
| Best F1: 0.12209302325581395 |
| Epoch: 21 |
| Precision: 0.1346153846153846 |
| Recall: 0.11170212765957446 |
| F1: 0.12209302325581395 |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11170212765957446 |
| Best F1: 0.12209302325581395 |
| Epoch: 22 |
| Average actions: 1.9296875 |
| Average target actions: 2.1119792461395264 |
| Precision: 0.08391608391608392 |
| Recall: 0.06382978723404255 |
| F1: 0.07250755287009063 |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11170212765957446 |
| Best F1: 0.12209302325581395 |
| Epoch: 23 |
| Precision: 0.08391608391608392 |
| Recall: 0.06382978723404255 |
| F1: 0.07250755287009063 |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11170212765957446 |
| Best F1: 0.12209302325581395 |
| Epoch: 24 |
| Average actions: 2.2213540077209473 |
| Average target actions: 2.3151042461395264 |
| Precision: 0.09815950920245399 |
| Recall: 0.0851063829787234 |
| F1: 0.09116809116809117 |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11170212765957446 |
| Best F1: 0.12209302325581395 |
| Epoch: 25 |
| Precision: 0.09815950920245399 |
| Recall: 0.0851063829787234 |
| F1: 0.09116809116809117 |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11170212765957446 |
| Best F1: 0.12209302325581395 |
| Epoch: 26 |
| Average actions: 2.1171875 |
| Average target actions: 2.7890625 |
| Precision: 0.12987012987012986 |
| Recall: 0.10638297872340426 |
| F1: 0.11695906432748537 |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11170212765957446 |
| Best F1: 0.12209302325581395 |
| Epoch: 27 |
| Precision: 0.12987012987012986 |
| Recall: 0.10638297872340426 |
| F1: 0.11695906432748537 |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11170212765957446 |
| Best F1: 0.12209302325581395 |
| Epoch: 28 |
| Average actions: 1.7734375 |
| Average target actions: 2.484375 |
| Precision: 0.08823529411764706 |
| Recall: 0.06382978723404255 |
| F1: 0.07407407407407407 |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11170212765957446 |
| Best F1: 0.12209302325581395 |
| Epoch: 29 |
| Precision: 0.08823529411764706 |
| Recall: 0.06382978723404255 |
| F1: 0.07407407407407407 |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11170212765957446 |
| Best F1: 0.12209302325581395 |
| Epoch: 30 |
| Average actions: 2.1822915077209473 |
| Average target actions: 2.3489584922790527 |
| Precision: 0.10126582278481013 |
| Recall: 0.0851063829787234 |
| F1: 0.09248554913294797 |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11170212765957446 |
| Best F1: 0.12209302325581395 |
| Epoch: 31 |
| Precision: 0.10126582278481013 |
| Recall: 0.0851063829787234 |
| F1: 0.09248554913294797 |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11170212765957446 |
| Best F1: 0.12209302325581395 |
| Epoch: 32 |
| Average actions: 2.0442707538604736 |
| Average target actions: 2.6197917461395264 |
| Precision: 0.12345679012345678 |
| Recall: 0.10638297872340426 |
| F1: 0.11428571428571428 |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11170212765957446 |
| Best F1: 0.12209302325581395 |
| Epoch: 33 |
| Precision: 0.12345679012345678 |
| Recall: 0.10638297872340426 |
| F1: 0.11428571428571428 |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11170212765957446 |
| Best F1: 0.12209302325581395 |
| Epoch: 34 |
| Average actions: 1.8307292461395264 |
| Average target actions: 2.5859375 |
| Precision: 0.11510791366906475 |
| Recall: 0.0851063829787234 |
| F1: 0.09785932721712538 |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11170212765957446 |
| Best F1: 0.12209302325581395 |
| Epoch: 35 |
| Precision: 0.11510791366906475 |
| Recall: 0.0851063829787234 |
| F1: 0.09785932721712538 |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11170212765957446 |
| Best F1: 0.12209302325581395 |
| Epoch: 36 |
| Average actions: 2.2838540077209473 |
| Average target actions: 2.3489584922790527 |
| Precision: 0.1286549707602339 |
| Recall: 0.11702127659574468 |
| F1: 0.12256267409470752 |
| <<dialog policy>> epoch 36: saved network to mdl |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11702127659574468 |
| Best F1: 0.12256267409470752 |
| Epoch: 37 |
| Precision: 0.1286549707602339 |
| Recall: 0.11702127659574468 |
| F1: 0.12256267409470752 |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11702127659574468 |
| Best F1: 0.12256267409470752 |
| Epoch: 38 |
| Average actions: 1.9479167461395264 |
| Average target actions: 2.7552084922790527 |
| Precision: 0.12337662337662338 |
| Recall: 0.10106382978723404 |
| F1: 0.1111111111111111 |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11702127659574468 |
| Best F1: 0.12256267409470752 |
| Epoch: 39 |
| Precision: 0.12337662337662338 |
| Recall: 0.10106382978723404 |
| F1: 0.1111111111111111 |
| Best Precision: 0.1346153846153846 |
| Best Recall: 0.11702127659574468 |
| Best F1: 0.12256267409470752 |
|
|