Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
This lecture covers the development of dhSegment, an open-source package for document segmentation using PyTorch. Originally implemented in TensorFlow, dhSegment was ported to PyTorch to integrate state-of-the-art architectures more easily. The lecture discusses the challenges in automatic document analysis, the transition from TensorFlow to PyTorch, and the improvements made to dhSegment. Specific topics include image semantic segmentation, object recognition, text recognition, and analysis systems.