From: Research on predicting 2D-HP protein folding using reinforcement learning with full state space