Posts

Showing posts from April, 2017
Parsing PDF to text using PowerShell There are times when you come and you find some Important PDF file (Something like a low level design or something similar), The point is I really hate PDF files... I was googling a way which i can use to read PDF files using PowerShell, this can be helpful of you have a lots of PDF files and you want to filter these PDF's based on certain word or a string (Using a regex or something similar to do that). I know i was talking too much so let cut to the chase : I found this amazing .NET library which you can use to parse a PDF file(Convert it into a text). The library is called "ITextSharp", here you can fidn some information about it : http://sourceforge.net/projects/itextsharp/?source=typ_redirect I assume that you already know what the hell I am talking about so, This is the code which i got from from Stack Overflow : http://stackoverflow.com/questions/15684699/how-to-parse-pdf-content-to-database-with-pow