How Does Mini-Batching Affect Curvature Information for Second Order Deep Learning Optimization