fume-manage-python.git

U  
¡ý°dã@sdZdZddlZddlmZddlmZddlZddlmZm    Z    ddl
mZddlZddl Z ddlZddlZddlZddlZddlZddlZdd    Zd#ddZGd ddeZddZdZdZd$ddZd%ddZd&ddZd'ddZd(d d!Zed"kreej  ¡dS))z=Diagnostic functions, mainly for use when doing tech support.ÚMITéN)ÚBytesIO)Ú
HTMLParser)Ú BeautifulSoupÚ__version__)Úbuilder_registryc
CsÀtdttdtjdddg}|D]4}tjD]}||jkr2q(q2| |¡td|q(d|krÆ| d¡z*dd    l    m
}td
d tt |j¡Wn*tk
rÄ}ztdW5d }~XYnXd|krzdd l}td|jWn,tk
r}ztdW5d }~XYnXt|dr,| ¡}|D]}td|d}zt||d}    d}Wn8tk
r}ztd|t ¡W5d }~XYnX|r°td|t|     ¡tdq0d S)z¼Diagnostic suite for isolating common problems.
 
    :param data: A string containing markup that needs to be explained.
    :return: None; diagnostics are printed to standard output.
    z'Diagnostic running on Beautiful Soup %szPython version %súhtml.parserÚhtml5libÚlxmlz;I noticed that %s is not installed. Installing it may help.zlxml-xmlr©ÚetreezFound lxml version %sÚ.z.lxml is not installed or couldn't be imported.NzFound html5lib version %sz2html5lib is not installed or couldn't be imported.Úreadz#Trying to parse your markup with %sF)ÚfeaturesTú%s could not parse the markup.z#Here's what %s did with the markup:zP--------------------------------------------------------------------------------)ÚprintrÚsysÚversionrZbuildersrÚremoveÚappendr
rÚjoinÚmapÚstrZLXML_VERSIONÚImportErrorr    ÚhasattrrrÚ    ExceptionÚ    tracebackÚ    print_excZprettify)
ÚdataZ basic_parsersÚnameZbuilderrÚer    ÚparserÚsuccessÚsoup©r$úCd:\z\workplace\vscode\pyvenv\venv\Lib\site-packages\bs4/diagnose.pyÚdiagnosesZ
 
 
 
ÿÿ
ÿ
ÿr&TcKspddlm}| dd¡}t|tr,| d¡}t|}|j|f||d|D]\}}td||j    |j
fqLdS)    a´Print out the lxml events that occur during parsing.
 
    This lets you see how lxml parses a document when no Beautiful
    Soup code is running. You can use this to determine whether
    an lxml-specific problem is in Beautiful Soup's lxml tree builders
    or in lxml itself.
 
    :param data: Some markup.
    :param html: If True, markup will be parsed with lxml's HTML parser.
       if False, lxml's XML parser will be used.
    rrÚrecoverTÚutf8)Úhtmlr'z%s, %4s, %sN)r
rÚpopÚ
isinstancerÚencoderÚ    iterparserÚtagÚtext)rr)Úkwargsrr'ÚreaderÚeventÚelementr$r$r%Ú
lxml_traceNs
 
ÿÿÿr4c@s`eZdZdZddZddZddZdd    Zd
dZdd Z    ddZ
ddZddZddZ dS)ÚAnnouncingParserzèSubclass of HTMLParser that announces parse events, without doing
    anything else.
 
    You can use this to get a picture of how html.parser sees a given
    document. The easiest way to do this is to call `htmlparser_trace`.
    cCst|dS)N)r)ÚselfÚsr$r$r%Ú_plszAnnouncingParser._pcCs| d|¡dS)Nz%s START©r8)r6rÚattrsr$r$r%Úhandle_starttagosz AnnouncingParser.handle_starttagcCs| d|¡dS)Nz%s ENDr9©r6rr$r$r%Ú handle_endtagrszAnnouncingParser.handle_endtagcCs| d|¡dS)Nz%s DATAr9©r6rr$r$r%Úhandle_datauszAnnouncingParser.handle_datacCs| d|¡dS)Nz
%s CHARREFr9r<r$r$r%Úhandle_charrefxszAnnouncingParser.handle_charrefcCs| d|¡dS)Nz%s ENTITYREFr9r<r$r$r%Úhandle_entityref{sz!AnnouncingParser.handle_entityrefcCs| d|¡dS)Nz
%s COMMENTr9r>r$r$r%Úhandle_comment~szAnnouncingParser.handle_commentcCs| d|¡dS)Nz%s DECLr9r>r$r$r%Úhandle_declszAnnouncingParser.handle_declcCs| d|¡dS)Nz%s UNKNOWN-DECLr9r>r$r$r%Úunknown_declszAnnouncingParser.unknown_declcCs| d|¡dS)Nz%s PIr9r>r$r$r%Ú    handle_piszAnnouncingParser.handle_piN)Ú__name__Ú
__module__Ú__qualname__Ú__doc__r8r;r=r?r@rArBrCrDrEr$r$r$r%r5dsr5cCst}| |¡dS)zÂPrint out the HTMLParser events that occur during parsing.
 
    This lets you see how HTMLParser parses a document when no
    Beautiful Soup code is running.
 
    :param data: Some markup.
    N)r5Úfeed)rr!r$r$r%Úhtmlparser_tracesrKZaeiouZbcdfghjklmnpqrstvwxyzécCs:d}t|D](}|ddkr"t}nt}|t |¡7}q|S)z#Generate a random word-like string.Úér)ÚrangeÚ_consonantsÚ_vowelsÚrandomÚchoice)Úlengthr7ÚiÚtr$r$r%ÚrwordsrWécCsd ddt|D¡S)z'Generate a random sentence-like string.ú css|]}tt dd¡VqdS)rXé    N)rWrRÚrandint)Ú.0rUr$r$r%Ú    <genexpr>¥szrsentence.<locals>.<genexpr>)rrO)rTr$r$r%Ú    rsentence£sr^éècCs¤dddddddg}g}t|D]r}t dd    ¡}|dkrPt |¡}| d
|¡q|dkrp| tt dd¡¡q|d krt |¡}| d|¡qdd |¡dS)z+Randomly generate an invalid HTML document.ÚpÚdivÚspanrUÚbÚscriptÚtableréz<%s>érXrNz</%s>z<html>Ú
z</html>)rOrRr[rSrr^r)Únum_elementsZ    tag_namesÚelementsrUrSZtag_namer$r$r%Úrdoc§s
 
rké c
Cs$tdtt|}tdt|dddgddfD]z}d}z"t ¡}t||}t ¡}d}Wn6tk
r}ztd    |t ¡W5d
}~XYnX|r4td|||fq4dd l    m
}t ¡}| |¡t ¡}td||dd
l}    |      ¡}t ¡}| |¡t ¡}td||d
S)z.Very basic head-to-head performance benchmark.z1Comparative parser benchmark on Beautiful Soup %sz3Generated a large invalid HTML document (%d bytes).r
r)r    rFTrNz"BS4+%s parsed the markup in %.2fs.rrz$Raw lxml parsed the markup in %.2fs.z(Raw html5lib parsed the markup in %.2fs.)rrrkÚlenÚtimerrrrr
rZHTMLr    rÚparse)
rirr!r"Úar#rcr rr    r$r$r%Úbenchmark_parsers¹s4
 
 
rqr
cCsXt ¡}|j}t|}tt||d}t d|||¡t     |¡}| 
d¡| dd¡dS)z7Use Python's profiler on a randomly generated document.)Úbs4rr!zbs4.BeautifulSoup(data, parser)Z
cumulativez _html5lib|bs4é2N)ÚtempfileÚNamedTemporaryFilerrkÚdictrrÚcProfileZrunctxÚpstatsZStatsZ
sort_statsZprint_stats)rir!Z
filehandleÚfilenamerÚvarsÚstatsr$r$r%ÚprofileÙs
 
r|Ú__main__)T)rL)rX)r_)rl)rlr
)!rIÚ__license__rwÚiorÚhtml.parserrrrrrZbs4.builderrÚosrxrRrtrnrrr&r4r5rKrQrPrWr^rkrqr|rFÚstdinrr$r$r$r%Ú<module>s88
&