Text To Image Synthesis Via Mask Anchor Points And Aesthetic Assessment