You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I0828 22:17:48.339010 11510 solver.cpp:251] Iteration 0, Testing net (#0)
75
-
I0828 22:18:14.313817 11510 solver.cpp:302] Test net output #0: accuracy = 0.0416
76
-
I0828 22:18:14.476822 11510 solver.cpp:195] Iteration 0, loss = 3.75717
75
+
I0828 22:18:14.313817 11510 solver.cpp:302] Test net output #0: accuracy = 0.0308
76
+
I0828 22:18:14.476822 11510 solver.cpp:195] Iteration 0, loss = 3.78589
77
77
I0828 22:18:14.476878 11510 solver.cpp:397] Iteration 0, lr = 0.001
78
-
I0828 22:18:19.700408 11510 solver.cpp:195] Iteration 20, loss = 3.1689
78
+
I0828 22:18:19.700408 11510 solver.cpp:195] Iteration 20, loss = 3.25728
79
79
I0828 22:18:19.700461 11510 solver.cpp:397] Iteration 20, lr = 0.001
80
-
I0828 22:18:24.924685 11510 solver.cpp:195] Iteration 40, loss = 2.3549
80
+
I0828 22:18:24.924685 11510 solver.cpp:195] Iteration 40, loss = 2.18531
81
81
I0828 22:18:24.924741 11510 solver.cpp:397] Iteration 40, lr = 0.001
82
-
I0828 22:18:30.114858 11510 solver.cpp:195] Iteration 60, loss = 2.74191
82
+
I0828 22:18:30.114858 11510 solver.cpp:195] Iteration 60, loss = 2.4915
83
83
I0828 22:18:30.114910 11510 solver.cpp:397] Iteration 60, lr = 0.001
84
-
I0828 22:18:35.328071 11510 solver.cpp:195] Iteration 80, loss = 1.9147
84
+
I0828 22:18:35.328071 11510 solver.cpp:195] Iteration 80, loss = 2.04539
85
85
I0828 22:18:35.328127 11510 solver.cpp:397] Iteration 80, lr = 0.001
86
-
I0828 22:18:40.588317 11510 solver.cpp:195] Iteration 100, loss = 1.81419
86
+
I0828 22:18:40.588317 11510 solver.cpp:195] Iteration 100, loss = 2.1924
87
87
I0828 22:18:40.588373 11510 solver.cpp:397] Iteration 100, lr = 0.001
88
-
I0828 22:18:46.171576 11510 solver.cpp:195] Iteration 120, loss = 2.02105
88
+
I0828 22:18:46.171576 11510 solver.cpp:195] Iteration 120, loss = 2.25107
89
89
I0828 22:18:46.171669 11510 solver.cpp:397] Iteration 120, lr = 0.001
90
-
I0828 22:18:51.757809 11510 solver.cpp:195] Iteration 140, loss = 1.49083
90
+
I0828 22:18:51.757809 11510 solver.cpp:195] Iteration 140, loss = 1.355
91
91
I0828 22:18:51.757863 11510 solver.cpp:397] Iteration 140, lr = 0.001
92
-
I0828 22:18:57.345080 11510 solver.cpp:195] Iteration 160, loss = 1.35319
92
+
I0828 22:18:57.345080 11510 solver.cpp:195] Iteration 160, loss = 1.40815
93
93
I0828 22:18:57.345135 11510 solver.cpp:397] Iteration 160, lr = 0.001
94
-
I0828 22:19:02.928794 11510 solver.cpp:195] Iteration 180, loss = 1.11658
94
+
I0828 22:19:02.928794 11510 solver.cpp:195] Iteration 180, loss = 1.6558
95
95
I0828 22:19:02.928850 11510 solver.cpp:397] Iteration 180, lr = 0.001
96
-
I0828 22:19:08.514497 11510 solver.cpp:195] Iteration 200, loss = 1.08851
96
+
I0828 22:19:08.514497 11510 solver.cpp:195] Iteration 200, loss = 0.88126
97
97
I0828 22:19:08.514552 11510 solver.cpp:397] Iteration 200, lr = 0.001
98
98
99
99
[...]
100
100
101
-
I0828 22:22:40.789010 11510 solver.cpp:195] Iteration 960, loss = 0.0844627
101
+
I0828 22:22:40.789010 11510 solver.cpp:195] Iteration 960, loss = 0.112586
102
102
I0828 22:22:40.789175 11510 solver.cpp:397] Iteration 960, lr = 0.001
103
-
I0828 22:22:46.376626 11510 solver.cpp:195] Iteration 980, loss = 0.0110937
103
+
I0828 22:22:46.376626 11510 solver.cpp:195] Iteration 980, loss = 0.0959077
104
104
I0828 22:22:46.376682 11510 solver.cpp:397] Iteration 980, lr = 0.001
105
105
I0828 22:22:51.687258 11510 solver.cpp:251] Iteration 1000, Testing net (#0)
106
-
I0828 22:23:17.438894 11510 solver.cpp:302] Test net output #0: accuracy = 1
106
+
I0828 22:23:17.438894 11510 solver.cpp:302] Test net output #0: accuracy = 0.2356
107
107
108
-
Note how rapidly the loss went down. Although the 100% accuracy is optimistic, it is evidence the model is learning quickly and well.
108
+
Note how rapidly the loss went down. Although the 23.5% accuracy is only modest, it was achieved in only 1000, and evidence that the model is starting to learn quickly and well.
109
+
Once the model is fully fine-tuned on the whole training set over 100,000 iterations the final validation accuracy is 91.64%. This takes ~7 hours in Caffe on a K40 GPU.
109
110
110
111
For comparison, here is how the loss goes down when we do not start with a pre-trained model:
0 commit comments