python - Extracting data from txt files -


OK, GIT is using it from Bash. After running with me, I have securities and exchange commission db < code> txt file which is Edgar on my hard drive. I'm using Win 7. txt has the HTML tag in the files.

I was wondering because the files in the text are in this strict format. If the SEC agency has a way to remove certain things from the early nineties, then we say that

  & lt; us-gaap: iTaxxPensBenfit Reference = "eol_PE9523 ---- 1310-K0013_STD_365_20131231_0" decimal = "-3" id = "id_3914012_7 F3BEF88-8CD1-49E7-8A78-91A091178D1B_1_13" Unitof = "ISO 4217_USD" & gt; 40315000 & lt; / us-gaap: iCarac expensebnit & gt;   

What is strict with the use of a script or GIT repository after the format? For example, can anyone remove the hole table from the TXT file?

Can any of these come in the gate and do such a job? No work can be done, no library, guit, scripts can be picked up with some work and amendment. I read the instructions (whenever), but I do not understand many things.

This is not HTML it looks like XML - try to use XML parser for Python The tutorial is on their page, for example, and is parsing the relevant information.

Comments

Popular posts from this blog

Verilog Error: output or inout port "Q" must be connected to a structural net expression -

jasper reports - How to center align barcode using jasperreports and barcode4j -

c# - ASP.NET MVC - Attaching an entity of type 'MODELNAME' failed because another entity of the same type already has the same primary key value -