"Region Based Convolutional Neural Networks (R-CNN) are a family of machine learning models for computer vision and specifically object detection." -- Wikipedia
The page above mentions a number of packages available for using R-CNNs. For now, this page will concentrate on [1].