The University of Information Technology.
Self-learning Reinforced with Human Feedback