A Dataset of AI-generated poscasts by @jilltxt

https://dataverse.no/dataset.xhtml?persistentId=doi:10.18710/RNTF9H

Certainly a new way of looking at your digital content -- especially the blank Untitled.pdf ;)

#DH #DigitalHumanities #Dataset #ContentAnalysis #Authenticity #LLM #GenAI #WTFPDF

A dataset of AI-generated podcasts created using NotebookLM

Audio files and transcripts of AI-generated podcasts created using Google's NotebookLM. Four podcasts were created by uploading specific PDFs. Twen...

DataverseNO

Porting SafeText and analyzing digital content with Apache Tika

by @beet_keeper

Last year I wrote about pitfalls in modern journalism, especially with regards to receiving documents and information from whistleblowers without offering them adequate protection.

The tl;dr is that you, as a whistleblower, need to protect yourself; and you, as an editor or journalist, need to protect your whistleblowers.

Steganographic fingerprints might be one method adopted to detect someone leaking information. Steganographic characters replace common textual characters with unusual but hard to detect variants, e.g. they look the same to the human eye, or are actually invisible. Using a tool called SafeText by David Jacobson we can identify these hidden fingerprints in the content that you share.

I firmly believe we can find clues about what is important to preserve, or learn to preserve, when we analyse the content of the digital record and not just the (file) format of the digital record.

A file can contain many different features and these are all challenges to their future interpretation, and thus preservation.

I wanted to use SafeText in some of my other non-Python tooling and so I decided to port the code to Golang as a composable module and binary.

By coincidence at the time I started writing this I had also just written about revisiting tikalinkextract and so I thought I would write this small explanation about how you might combine Tika and SafeText to perform some content analysis of your own.

Who knows, maybe we will find a conspiracy. Maybe we’ll find secret codes in our own digital records. Maybe we’ll learn something new about our records…

Lets have a look at putting Tika and SafeText together and see where it goes.

Continue reading “Porting SafeText and analyzing digital content with Apache Tika”


#ApacheTika #authenticity #Code #Coding #ContentAnalysis #Data #DigitalHumanities #digitalLiteracy #DigitalPreservation #Golang #integrity #Journalism #Metadata #Paradata #SafeText #steganography #Whistleblow #Whistleblower

Nghĩa vụ dự án đầu tay: So sánh quảng cáo sản phẩm làm sạch ở các nước để phân tích hành vi tiêu dùng & văn hóa. Tác giả cần phản hồi về phương pháp phân tích nội dung + mã hóa dữ liệu từ ChatGPT. Dự án được dùng xây dựng portfolio ứng tuyển ngành marketing. Cần cộng đồng hỗ trợ tránh "tê liệt phân tích"! #Marketing #NghiênCứu #DựÁnĐầuTay #ConsumerBehavior #ContentAnalysis #MarketingResearch #VietnamBusiness #ThịTrườngTiêuDùng #StartupProject

https://www.reddit.com/r/SideProject/comments/1qt2e9

Spent 8 tháng xây dựng công cụ SEO tích hợp heatmap theo dõi hành vi người đọc - gọi là Rytar. Mục tiêu: biết chính xác độc giả dừng lại, bỏ qua hay tương tác với phần nào bài viết. Tự code, không marketing, mới có 12 người dùng. Nếu làm SEO, content và muốn thử – có thể dùng miễn phí để đánh giá. Có thực sự hữu ích hay chỉ là sản phẩm "vô tình" của sự tự kỷ ám thị?

#Rytar #SEO #ContentMarketing #SideProject #Feedback #CôngCụSEO #PhátTriểnSảnPhẩm #StartUp #ContentAnalysis #Heatmap #ViếtBlog #

New tutorial: Discover hidden themes in your writing with Quarkus + DeepLearning4j.
We scrape, embed, and cluster Substack articles. Showing how Java can power AI content analysis.
https://www.the-main-thread.com/p/quarkus-deeplearning4j-java-substack-clustering

#Java #Quarkus #AI #DeepLearning4j #ContentAnalysis

Công cụ ContentAnalysis.ai giúp đánh giá nhanh chất lượng nội dung & SEO! 🚀

✅ Đánh giá độ dễ đọc, cấu trúc, tín hiệu E-E-A-T (kinh nghiệm, chuyên môn, uy tín & độ tin cậy).
✅ Kiểm tra SEO on-page (tiêu đề, meta, heading, liên kết nội bộ...).
✅ Đề xuất các chỉnh sửa để cải thiện nội dung.
✅ Chấm điểm để theo dõi sự cải thiện.

#SEO #ContentMarketing #AI #ContentAnalysis #Marketing #NộiDung #TiếpThịNộiDung #CôngCụSEO #AI

https://www.reddit.com/r/SideProject/comments/1nfrjfz/building_contentanal

I’m truly grateful my article "Too Cute to Be a Crime? AI-Generated Lolita Aesthetics and the Legal Limits of Synthetic Girlhood on TikTok" has been published in the International Journal for Crime, Law and AI.

I hope that this will be a small step towards understanding how technology reshapes law and culture.
https://www.academia.edu/130116050/Too_Cute_to_Be_a_Crime_AI_Generated_Lolita_Aesthetics_and_the_Legal_Limits_of_Synthetic_Girlhood_on_TikTok

#AIResearch #DigitalCulture #TikTok #LolitaAesthetics #SyntheticMedia #LawAndTech #MediaStudies #AIandLaw #ContentAnalysis #Lolita #AI #MediaScholar #SocialMedia

Dribble the News: 3 Methods to Use News Content With AI | HackerNoon

Discover 3 innovative methods to curate and analyze news using AI, from freeform exploration to satire. Perfect for journalists, analysts and marketers

Which tools do you use for quantitative content analysis or annotation tasks which is simpler than MAXQDA or Inception?

I am looking for a tool which is able to display long text like a newspaper article in a nice, readable way, some metadata about the text and a very basic scheme of categories.

#commscholars #annotation #contentanalysis

🐭 The Gamer and the Nihilist: An analysis of Product Hunt, 2014 - 2021

(... finally the send-up Product Hunt deserves!)

https://components.one/posts/gamer-and-nihilist-product-hunt

#games #productivity #business #startups #producthunt #marketing #culture #contentanalysis

The Gamer and the Nihilist

An analysis of Product Hunt, 2014 - 2021

Components