No.19811
(i have already made this in other chans, i need all the help i can get)
so i'm motivated to planing to create a archive for threads and websites. thread and web writings that are important enough, have quality, and or can be used to counter western media and history naratives.
the archive i want to create for the threads is different from things like internet archive or things like that because i want to actually save all the file that is uploaded unlike regular archive where not every file and many that are uploaded in the thread were not saved in the archive.
if i can i want to make a website for this but i do not have any experience about creating website and coding nor can i do it. i also have special-ed mental that make me unable to learn coding like normal people so its hard.
my main plan is to use httrack and use every file format list from wikipedia and other websites, then copy that list to httrack file format selection thing
i want help from every people here, so if you can please send something
also adhd brained, if anyone can please help me tidy the "list of file format" list on wikipedia.
delete whatever decription/writing near it, make it sort of like a list like the pic i post.
add +*. to the front.
+*.[INSERT FILE FORMAT]
https://en.wikipedia.org/wiki/List_of_file_formats No.19812
so something like archived.moe but for /leftypol/?
No.19814
>>19812i don't use moe, but yes, like that. and u can have all the file that get posted
No.19815
i like my internet content ephemeral
No.19816
note: main plan was too save the webpage offline in my computer. making website was the second
No.19817
if anyone know other chan/IB/forum that can help post please
No.19820
>>19817Have you tried lainchan? They're more tech-centered.
No.19821
I used to know people who were obsessed with this imageboard archival shit years and years ago
Literally all of this unpaid effort just to archive the worst slop on the internet even against the wishes of the userbase creating that content
They even tried to acquire special magnetic tape drives that would last longer
It was hilariously strange in retrospect, although at the time I was just interested in their knowledge on sysadmin'ing
No.19824
>>19821there is only two IB like this with content like this and quality like this, it should be archived
No.19873
>>19814you can see what files are allowed from the codebase, take meds and go to work
>>19816archivebox is a thing btw, idk if it exactly suits your needs but it might be interesting to you
>>19821fr
>>19811anon why tf do you not just keep some form of personal knowledge organiztion, and when you see something cool you copy it down? Also stuff you read from anons on a forum should be taken with a grain of salt - many posts could be stripped down to some interesting research avenues, which could be what you actually save/write down
Keeping a [bunch of text] whole post only makes sense if you really appreciate the writing as a quote, for its special value in conveying something elegantly for example. Saving a [dialogue] thread makes pretty much no sense ever even though it might have in some rare cases entertainment or educational value… its so bulky, and for what?
No.19878
>>19824>quality like thiswhat are you talking about there are hundreds of low quality imageboards.
No.19879
just fucking take screenshots of noteworthy things
please
No.19897
>>19879i know, but i want more, and it have limitation
No.19901
i dont want my posts archived forever by some bot
No.20265
>>19901uyghur they're already being archived and harvested by the glowies that run this shitsite
No.20449
guys, i keep getting derailed by adhd and procrastination, how do i cope and finish this ? i want to atleast be done with the note file format thing
No.20450
already tried the p3 thing in the gui httrack, it does not work
No.20461
leftypol supported file types:
JPEG Files
BMP Files
GIF Files
PNG Files
MP3 Files
MP4 Files (Supports thumbnail)
WEBM Files (Supports thumbnail)
PDF Files (Supports thumbnail)
EPUB Files
DJVU Files (Supports thumbnail)
Text Files (Supports thumbnail)
ZIP Files
GZ Files
BZ2 Files
leftychan supported file types:
JPEG Files
BMP Files
GIF Files
PNG Files
MP3 Files
MP4 Files (Supports thumbnail)
WEBM Files (Supports thumbnail)
PDF Files (Supports thumbnail)
EPUB Files
DJVU Files (Supports thumbnail)
Text Files (Supports thumbnail)
ZIP Files
GZ Files
BZ2 Files
is this actually the only file format that can be uploaded ? or are there any other that can be uploaded but are not on the list ?
No.20481
GUYS GUYS, I DID IT, THE NOTEPAD LIST IS 1/6 DONE !!!!! I ACTUALLY REMEMBERED TO DO IT !!!!
No.20483
>>20449only one thing that can help you
a m p h e t a m i n e No.20577
THE NOTEPAD LIST, IT IS 2/6 DONE
No.20602
currently adding dots
No.20627
>>20600some look like file extensions, some look like MIME names.
No.20766
for incase:
checkpoint: clear text
line of the text (description line included)= 1773
last line: Pseudo-pipelines, Pseudo-pipeline
No.20917
i want to make a thread about website to archive, in what board should i put it ?
No.21057
>>21056also this is the final fix (i think). later maybe i will post the preious version with the fix. post if there are something i miss
No.21060
i forgot that you actually have to make it into horizontal line for it to work in httrack. anyone know a program or cript that can help ? anyone wanna help ?
No.21110
I may be an idiot, but… WHAT THE FUCK IS THIS THREAD ABOUT??
OP hasn't archived shit, it's just text files with filetype lists. He spent a month putting a dot in front of extensions he took from Wikipedia.
>>21105>from kate gang(???) leftychan.orgit's filetypes, one of which is .apk, I didn't know leftychan.org hosted APKs.
No.21113
>>21110the filetype list txt files are for archiving tool.
to copy paste in httrack.
hence why i want a list full of all the file kind taken straight out of wikipedia.
this thread is the creation of my autismretardationspecialeducationadhdbrain.
hence why thread is looking derailed.
No.21115
>>21113>autismretardationspecialeducationadhdbrainbut anon, I have autismretardationadhdbrain and that's why I hate things that are complicated. Now that I understand what you want to do, I can tell you you could have used a single wget line.
wget -mpckE --user-agent="" -e robots=off --wait 1 www.foo.com
Explanation:
https://dheinemann.com/posts/2022-02-05-archiving-a-website-with-wgetThen with { } and the && operator (or a shell script) you can download all kinds of websites, e.g.
#!/bin/sh
wget -mpckE --user-agent="" -e robots=off --wait 1 $1
echo "done"
save as archive.sh, chmod +x it and then in terminal:
$ ./script.sh www.leftypol.{org,net}
wget will automatically convert all pages to .html, make the links relative, and you will have an offline mirror of the website.
No.21186
>>21115woah. i will use this
No.21187
>>21115based wget guru
it blew my mind how good it was for downloading multiple issues of periodicals from libgen
Unique IPs: 10