On Non-Convex Splitting Methods For Markovian Information Theoretic Representation Learning