fume-manage-python.git

U  
¬ý°d¦ã@sdZddlmZddlmZddlZddlmZddl    m
Z
ddlmZm Z er\ddlmZd    d
ddd dZdddddddZdddddddddZdS)zH
Module containing utilities for NDFrame.sample() and .GroupBy.sample()
é)Úannotations)Ú TYPE_CHECKINGN)Úlib)ÚAxisInt)ÚABCDataFrameÚ    ABCSeries)ÚNDFramerrz
np.ndarray)ÚobjÚaxisÚreturnc
Cst|tr| |j|¡}t|trt|tr||dkrrz||}Wqztk
rn}ztd|W5d}~XYqzXqtdntdt|tr|j}n|j    }||ddj
}t||j|krÄtdt  |¡rÖtd    |dk ¡rêtd
t |¡}| ¡r| ¡}d||<|S)zþ
    Process and validate the `weights` argument to `NDFrame.sample` and
    `.GroupBy.sample`.
 
    Returns `weights` as an ndarray[np.float64], validated except for normalizing
    weights (because that must be done groupwise in groupby sampling).
    rz+String passed to weights not a valid columnNzLStrings can only be passed to weights when sampling from rows on a DataFramez@Strings cannot be passed as weights when sampling from a Series.Úfloat64)Zdtypez5Weights and axis to be sampled must be of same lengthz*weight vector may not include `inf` valuesz.weight vector many not include negative values)Ú
isinstancerZreindexZaxesÚstrrÚKeyErrorÚ
ValueErrorZ_constructorZ_constructor_slicedZ_valuesÚlenÚshaperZhas_infsÚanyÚnpÚisnanÚcopy)r    Úweightsr
ÚerrÚfuncÚmissing©rúId:\z\workplace\vscode\pyvenv\venv\Lib\site-packages\pandas/core/sample.pyÚpreprocess_weightssD    
 
 
ÿþÿÿ
 
 
 
rz
int | Nonezfloat | NoneÚbool)ÚnÚfracÚreplacercCs|dkr|dkrd}nx|dk    r0|dk    r0tdn^|dk    r^|dkrHtd|ddkrtdn0|dk    sjt|dkr~|s~td|dkrtd|S)    zâ
    Process and validate the `n` and `frac` arguments to `NDFrame.sample` and
    `.GroupBy.sample`.
 
    Returns None if `frac` should be used (variable sampling sizes), otherwise returns
    the constant sampling size.
    Néz0Please enter a value for `frac` OR `n`, not bothrz=A negative number of rows requested. Please provide `n` >= 0.z$Only integers accepted as `n` valueszJReplace has to be set to `True` when upsampling the population `frac` > 1.z@A negative number of rows requested. Please provide `frac` >= 0.)rÚAssertionError)rr r!rrrÚprocess_sampling_sizeOs*
ÿ
ÿÿr$Úintznp.ndarray | Nonez+np.random.RandomState | np.random.Generator)Úobj_lenÚsizer!rÚrandom_statercCsH|dk    r*| ¡}|dkr"||}ntd|j||||djtjddS)ac
    Randomly sample `size` indices in `np.arange(obj_len)`
 
    Parameters
    ----------
    obj_len : int
        The length of the indices being considered
    size : int
        The number of values to choose
    replace : bool
        Allow or disallow sampling of the same row more than once.
    weights : np.ndarray[np.float64] or None
        If None, equal probability weighting, otherwise weights according
        to the vector normalized
    random_state: np.random.RandomState or np.random.Generator
        State used for the random sampling
 
    Returns
    -------
    np.ndarray[np.intp]
    Nrz$Invalid weights: weights sum to zero)r'r!ÚpF)r)ÚsumrÚchoiceZastyperZintp)r&r'r!rr(Z
weight_sumrrrÚsamplets
ÿr,)Ú__doc__Ú
__future__rÚtypingrÚnumpyrZpandas._libsrZpandas._typingrZpandas.core.dtypes.genericrrZpandas.core.genericrrr$r,rrrrÚ<module>s9%