[ home / rules / faq ] [ overboard / sfw / alt ] [ leftypol / siberia / edu / hobby / tech / games / anime / music / draw / AKM ] [ meta / roulette ] [ cytube / wiki / git ] [ GET / ref / marx / booru / zine ]

/tech/ - Technology

"Technology reveals the active relation of man to nature" - Karl Marx
Name
Options
Subject
Comment
Flag
File
Embed
Password (For file deletion.)

Join our Matrix Chat <=> IRC: #leftypol on Rizon


File: 1685921309454-0.png (25.92 KB, 508x681, Capturebin1.png)

File: 1685921309454-1.png (29.7 KB, 1060x436, Capturebin3.png)

File: 1685921309454-2.png (17.1 KB, 221x592, Capturebin2.png)

 No.19811

(i have already made this in other chans, i need all the help i can get)

so i'm motivated to planing to create a archive for threads and websites. thread and web writings that are important enough, have quality, and or can be used to counter western media and history naratives.

the archive i want to create for the threads is different from things like internet archive or things like that because i want to actually save all the file that is uploaded unlike regular archive where not every file and many that are uploaded in the thread were not saved in the archive.

if i can i want to make a website for this but i do not have any experience about creating website and coding nor can i do it. i also have special-ed mental that make me unable to learn coding like normal people so its hard.

my main plan is to use httrack and use every file format list from wikipedia and other websites, then copy that list to httrack file format selection thing

i want help from every people here, so if you can please send something

also adhd brained, if anyone can please help me tidy the "list of file format" list on wikipedia.

delete whatever decription/writing near it, make it sort of like a list like the pic i post.

add +*. to the front.
+*.[INSERT FILE FORMAT]

https://en.wikipedia.org/wiki/List_of_file_formats

 No.19812

so something like archived.moe but for /leftypol/?

 No.19814

>>19812
i don't use moe, but yes, like that. and u can have all the file that get posted

 No.19815

i like my internet content ephemeral

 No.19816

note: main plan was too save the webpage offline in my computer. making website was the second

 No.19817

if anyone know other chan/IB/forum that can help post please

 No.19820

>>19817
Have you tried lainchan? They're more tech-centered.

 No.19821

I used to know people who were obsessed with this imageboard archival shit years and years ago
Literally all of this unpaid effort just to archive the worst slop on the internet even against the wishes of the userbase creating that content
They even tried to acquire special magnetic tape drives that would last longer
It was hilariously strange in retrospect, although at the time I was just interested in their knowledge on sysadmin'ing

 No.19823


 No.19824

>>19821
there is only two IB like this with content like this and quality like this, it should be archived

 No.19873

>>19814
you can see what files are allowed from the codebase, take meds and go to work
>>19816
archivebox is a thing btw, idk if it exactly suits your needs but it might be interesting to you
>>19821
fr
>>19811
anon why tf do you not just keep some form of personal knowledge organiztion, and when you see something cool you copy it down? Also stuff you read from anons on a forum should be taken with a grain of salt - many posts could be stripped down to some interesting research avenues, which could be what you actually save/write down

Keeping a [bunch of text] whole post only makes sense if you really appreciate the writing as a quote, for its special value in conveying something elegantly for example. Saving a [dialogue] thread makes pretty much no sense ever even though it might have in some rare cases entertainment or educational value… its so bulky, and for what?

 No.19878

>>19824
>quality like this
what are you talking about there are hundreds of low quality imageboards.

 No.19879

just fucking take screenshots of noteworthy things
please

 No.19897

>>19879
i know, but i want more, and it have limitation

 No.19898


 No.19901

i dont want my posts archived forever by some bot

 No.19902

>>19901
i know

 No.20265

>>19901
uyghur they're already being archived and harvested by the glowies that run this shitsite

 No.20267

>>20265
thats nice dear

 No.20449

guys, i keep getting derailed by adhd and procrastination, how do i cope and finish this ? i want to atleast be done with the note file format thing

 No.20450

already tried the p3 thing in the gui httrack, it does not work

 No.20461

leftypol supported file types:
JPEG Files
BMP Files
GIF Files
PNG Files
MP3 Files
MP4 Files (Supports thumbnail)
WEBM Files (Supports thumbnail)
PDF Files (Supports thumbnail)
EPUB Files
DJVU Files (Supports thumbnail)
Text Files (Supports thumbnail)
ZIP Files
GZ Files
BZ2 Files

leftychan supported file types:
JPEG Files
BMP Files
GIF Files
PNG Files
MP3 Files
MP4 Files (Supports thumbnail)
WEBM Files (Supports thumbnail)
PDF Files (Supports thumbnail)
EPUB Files
DJVU Files (Supports thumbnail)
Text Files (Supports thumbnail)
ZIP Files
GZ Files
BZ2 Files

is this actually the only file format that can be uploaded ? or are there any other that can be uploaded but are not on the list ?

 No.20481

GUYS GUYS, I DID IT, THE NOTEPAD LIST IS 1/6 DONE !!!!! I ACTUALLY REMEMBERED TO DO IT !!!!

 No.20482

>>20481
wd comrade

 No.20483

>>20449
only one thing that can help you

a m p h e t a m i n e

 No.20577

THE NOTEPAD LIST, IT IS 2/6 DONE

 No.20600

UPDATE: PURE VERSION IS UP.
there is stuff that does not have a dot (".") and also stuff that are capitalized. at the same place there are stuff that have dots and are not capitalized. can anyone here explain and help ?

 No.20602

currently adding dots

 No.20627

>>20600
some look like file extensions, some look like MIME names.

 No.20640

the wikipedia page i use (in case if it get updated, which makes it hard to make sure the list are perfect)

 No.20766

for incase:
checkpoint: clear text
line of the text (description line included)= 1773
last line: Pseudo-pipelines, Pseudo-pipeline

 No.20861

almost done.
im starting to think i have ocd. and adhd at the same time

 No.20878

IT IS DONE !
if anyone still not sure you can use winmerge or some other thing to compare the wiki txt file with this one (v4)

 No.20914

>>20878
oh god

 No.20917

i want to make a thread about website to archive, in what board should i put it ?

 No.21056

copypaste edition.
there are some that i miss.
NIfTI,z10-z99,cursor [edit].

its been hard to do simple things like this. i do not think having adhd is this hard. i may be the worst variant of adhd person group. AND I CANNOT DO ANYTHING ABOUT IT!!!.

 No.21057

>>21056
also this is the final fix (i think). later maybe i will post the preious version with the fix. post if there are something i miss

 No.21060

i forgot that you actually have to make it into horizontal line for it to work in httrack. anyone know a program or cript that can help ? anyone wanna help ?

 No.21063

>>20461
maybe just archiving iB thread using the faq supported file types is fine…..

 No.21105

from kate gang(???) leftychan.org

 No.21107

>>21105
*.net

 No.21110

I may be an idiot, but… WHAT THE FUCK IS THIS THREAD ABOUT??

OP hasn't archived shit, it's just text files with filetype lists. He spent a month putting a dot in front of extensions he took from Wikipedia.
>>21105
>from kate gang(???) leftychan.org
it's filetypes, one of which is .apk, I didn't know leftychan.org hosted APKs.

 No.21113

>>21110
the filetype list txt files are for archiving tool.
to copy paste in httrack.
hence why i want a list full of all the file kind taken straight out of wikipedia.

this thread is the creation of my autismretardationspecialeducationadhdbrain.
hence why thread is looking derailed.

 No.21115

>>21113
>autismretardationspecialeducationadhdbrain
but anon, I have autismretardationadhdbrain and that's why I hate things that are complicated. Now that I understand what you want to do, I can tell you you could have used a single wget line.

wget -mpckE --user-agent="" -e robots=off --wait 1 www.foo.com

Explanation: https://dheinemann.com/posts/2022-02-05-archiving-a-website-with-wget

Then with { } and the && operator (or a shell script) you can download all kinds of websites, e.g.
#!/bin/sh
wget -mpckE --user-agent="" -e robots=off --wait 1 $1
echo "done"

save as archive.sh, chmod +x it and then in terminal:
$ ./script.sh www.leftypol.{org,net}


wget will automatically convert all pages to .html, make the links relative, and you will have an offline mirror of the website.

 No.21186

>>21115
woah. i will use this

 No.21187

>>21115
based wget guru
it blew my mind how good it was for downloading multiple issues of periodicals from libgen


Unique IPs: 10

[Return][Go to top] [Catalog] | [Home][Post a Reply]
Delete Post [ ]
[ home / rules / faq ] [ overboard / sfw / alt ] [ leftypol / siberia / edu / hobby / tech / games / anime / music / draw / AKM ] [ meta / roulette ] [ cytube / wiki / git ] [ GET / ref / marx / booru / zine ]