Ranter
Join devRant
Do all the things like
++ or -- rants, post your own rants, comment on others' rants and build your customized dev avatar
Sign Up
Pipeless API
From the creators of devRant, Pipeless lets you power real-time personalized recommendations and activity feeds using a simple API
Learn More
Comments
-
kpenc21328yIf you want to extract data from something, PDF is not the right format. :D It's not meant to be that thing.
-
yo-yo13638yMy guess is some stupid client asked for this extraction. And this guy wanted the money so he said he'll do it.
-
Heemers4938y@yo-yo I am doing a story on education and all the data I have is in the form of PDFs. I need to be able to play around with it and I can't do shit with a PDF that's why I need to have it on excel.
-
Heemers4938y@snakebyte that's how I got that data. You will be surprised how many organizations give journalists data on PDFs.. and we have the worst time trying to scrape that shit... and its not even properly structured... So I can't just have one code run through the whole document. I need to write a new script for every 5 pages or so... Ugh.!!!!!!!
Related Rants
Ugh.. why is it so hard to extract data from a god damned PDF and put it in a god damned excel. 😲
undefined
excel
python
pdf