Linux id-dci-web1412.main-hosting.eu 5.14.0-611.20.1.el9_7.x86_64 #1 SMP PREEMPT_DYNAMIC Wed Jan 14 06:35:04 EST 2026 x86_64
LiteSpeed
: 2a02:4780:6:1512:0:19fc:adf1:2 | : 216.73.216.85
Cant Read [ /etc/named.conf ]
8.1.34
u435990001
www.github.com/MadExploits
Terminal
AUTO ROOT
Adminer
Backdoor Destroyer
Linux Exploit
Lock Shell
Lock File
Create User
CREATE RDP
PHP Mailer
BACKCONNECT
UNLOCK SHELL
HASH IDENTIFIER
CPANEL RESET
CREATE WP USER
README
+ Create Folder
+ Create File
/
opt /
gsutil /
third_party /
chardet /
[ HOME SHELL ]
Name
Size
Permission
Action
.github
[ DIR ]
drwxr-xr-x
chardet
[ DIR ]
drwxr-xr-x
docs
[ DIR ]
drwxr-xr-x
tests
[ DIR ]
drwxr-xr-x
.git-blame-ignore-revs
84
B
-rw-r--r--
.gitattributes
14
B
-rw-r--r--
.gitignore
81
B
-rw-r--r--
.pre-commit-config.yaml
820
B
-rw-r--r--
.prospector.yaml
296
B
-rw-r--r--
LICENSE
25.91
KB
-rw-r--r--
MANIFEST.in
195
B
-rw-r--r--
NOTES.rst
3.68
KB
-rw-r--r--
README.rst
1.96
KB
-rw-r--r--
bench.py
4.79
KB
-rw-r--r--
convert_language_model.py
9.11
KB
-rw-r--r--
pyproject.toml
81
B
-rw-r--r--
setup.cfg
1.55
KB
-rw-r--r--
test.py
7.95
KB
-rw-r--r--
Delete
Unzip
Zip
${this.title}
Close
Code Editor : NOTES.rst
Class Hierarchy for chardet =========================== Universal Detector ------------------ Has a list of probers. CharSetProber ------------- Mostly abstract parent class. CharSetGroupProber ------------------ Runs a bunch of related probers at the same time and decides which is best. SBCSGroupProber --------------- SBCS = Single-ByteCharSet. Runs a bunch of SingleByteCharSetProbers. Always contains the same SingleByteCharSetProbers. SingleByteCharSetProber ----------------------- A CharSetProber that is used for detecting single-byte encodings by using a "precedence matrix" (i.e., a character bigram model). MBCSGroupProber --------------- Runs a bunch of MultiByteCharSetProbers. It also uses a UTF8Prober, which is essentially a MultiByteCharSetProber that only has a state machine. Always contains the same MultiByteCharSetProbers. MultiByteCharSetProber ---------------------- A CharSetProber that uses both a character unigram model (or "character distribution analysis") and an independent state machine for trying to detect and encoding. CodingStateMachine ------------------ Used for "coding scheme" detection, where we just look for either invalid byte sequences or sequences that only occur for that particular encoding. CharDistributionAnalysis ------------------------ Used for character unigram distribution encoding detection. Takes a mapping from characters to a "frequency order" (i.e., what frequency rank that byte has in the given encoding) and a "typical distribution ratio", which is the number of occurrences of the 512 most frequently used characters divided by the number of occurrences of the rest of the characters for a typical document. The "characters" in this case are 2-byte sequences and they are first converted to an "order" (name comes from ord() function, I believe). This "order" is used to index into the frequency order table to determine the frequency rank of that byte sequence. The reason this extra step is necessary is that the frequency rank table is language-specific (and not encoding-specific). What's where ============ Bigram files ------------ - ``hebrewprober.py`` - ``jpcntxprober.py`` - ``langbulgarianmodel.py`` - ``langcyrillicmodel.py`` - ``langgreekmodel.py`` - ``langhebrewmodel.py`` - ``langhungarianmodel.py`` - ``langthaimodel.py`` - ``latin1prober.py`` - ``sbcharsetprober.py`` - ``sbcsgroupprober.py`` Coding Scheme files ------------------- - ``escprober.py`` - ``escsm.py`` - ``utf8prober.py`` - ``codingstatemachine.py`` - ``mbcssmprober.py`` Unigram files ------------- - ``big5freqprober.py`` - ``chardistribution.py`` - ``euckrfreqprober.py`` - ``euctwfreqprober.py`` - ``gb2312freqprober.py`` - ``jisfreqprober.py`` Multibyte probers ----------------- - ``big5prober.py`` - ``cp949prober.py`` - ``eucjpprober.py`` - ``euckrprober.py`` - ``euctwprober.py`` - ``gb2312prober.py`` - ``mbcharsetprober.py`` - ``mbcsgroupprober.py`` - ``sjisprober.py`` Misc files ---------- - ``__init__.py`` (currently has ``detect`` function in it) - ``compat.py`` - ``enums.py`` - ``universaldetector.py`` - ``version.py`` Useful links ============ This is just a collection of information that I've found useful or thought might be useful in the future: - `BOM by Encoding`_ - `A Composite Approach to Language/Encoding Detection`_ - `What Every Programmer Absolutely...`_ - The actual `source`_ .. _BOM by Encoding: https://en.wikipedia.org/wiki/Byte_order_mark#Byte_order_marks_by_encoding .. _A Composite Approach to Language/Encoding Detection: http://www-archive.mozilla.org/projects/intl/UniversalCharsetDetection.html .. _What Every Programmer Absolutely...: http://kunststube.net/encoding/ .. _source: https://dxr.mozilla.org/mozilla/source/intl/chardet/
Close