I asked ChatGPT about some economic metrics, it gave me this table:
Category Archives: Data
Strawberry
Subversion @ GitHub
I wanted to use Subversion to checkout one of my GitHub repo branches, because an svn checkout only downloads the files it needs, not a full copy of every file ever added. But I discovered that GitHub sunset Subversion integration earlier this year. Sad face. Still, I suppose the economics justify that decision. As a consequence of my research, which was a bit sketchy because there is still heaps of documentation out there referring to the GitHub features which no longer exist, I did happen to learn about:
Data Analysis with Python for Excel Users – Full Course
Today I learned a little about Python, Jupyter, and Pandas in this video: Data Analysis with Python for Excel Users – Full Course.
Notation for ranges with included or excluded extremes
The square brackets are for a closed interval wherein endpoints are included.
The round brackets are for an open interval wherein endpoints are excluded.
Mail log IP address count
The following monster will parse the mail log and report on unique host connections along with a count.
cat /var/log/mail.log | \ grep ' connect from unknown' | \ awk '{ print $8 }' | \ sort | \ sed -n 's/.*\[\([0-9]\{1,3\}\.[0-9]\{1,3\}\.[0-9]\{1,3\}\.[0-9]\{1,3\}\)\].*/\1/p' | \ awk '{count[$1]++} END {for (word in count) print count[word], word}' | \ sort -n
Data formats including YAML
I finally got around to reading about YAML at Wikipedia. Worth doing if you use YAML anywhere and haven’t read the YAML page yet.
The Wikipedia article links to the official website: yaml.org, which is good fun. It’s written in YAML! :D
If you’re interested in data formats and markup languages here’s some other reading on the subject:
Relational Theory for Computer Professionals – C.J. Date
I was watching Relational Theory for Computer Professionals – C.J. Date and he made this fun remark:
There’s only three databases in the world: there’s departments and employees, there’s courses and students, and there’s suppliers and parts.
CRC
I watched Ben Eater’s videos How do CRCs work? and Checksums and Hamming distance (the full course is Learn about error detection) then found the Wikipedia page: Cyclic redundancy check.
Also of interest:
Keyspace on the back of an envelope
One question that always plagues me: should I use 32-bit or 64-bit keys. I can never make a firm decision on this point. But I’ve just done some back of the envelope calculations in thinking about it.
If you issued one identity per second, every second of every day, it would take 136 years to exhaust the keyspace.
If you issued one thousand identities per second, it would take 50 days to exhaust the keyspace.