This is Interesting: Free Magazines for Graphics designers and webmasters
Home > Archive > Websites forum > September 2005 > program to download archived google groups posts by author
You are viewing an archived Text-only version of the thread.
To view this thread in it's original format and/or if you want to reply to
this thread please [click here]
| Author |
program to download archived google groups posts by author
|
|
| Nospam 2005-09-10, 7:29 pm |
| I am looking for a program out there that can extract posts of a particular
author on google groups and ave them in one txt file hopefully without any
replies, is such a prog availble?
any help will be greatly appreciated
| |
| Paul N Burke 2005-09-10, 7:29 pm |
| On Sat, 10 Sep 2005 13:55:44 GMT, "Nospam" <nospam@home.com> wrote:
>I am looking for a program out there that can extract posts of a particular
>author on google groups and ave them in one txt file hopefully without any
>replies, is such a prog availble?
>
>any help will be greatly appreciated
>
>
>
Have you the authors permission to do that ?
----== Posted via codecomments.com - Unlimited-Uncensored-Secure Usenet News==----
http://www.codecomments.com The #1 Newsgroup Service in the World! 120,000+ Newsgroups
----= East and West-Coast Server Farms - Total Privacy via Encryption =----
| |
|
|
|
|
| Leif K-Brooks 2005-09-10, 7:29 pm |
| Barbara de Zoete wrote:
> Besides this being off-topic: Does Google have the authors permission
> [to archive their Usenet messages]?
Implicit permission, yes.
| |
|
|
| John Bokma 2005-09-10, 7:29 pm |
| "Barbara de Zoete" <b_de_zoete@hotmail.com> wrote:
> On Sat, 10 Sep 2005 15:04:11 GMT, Leif K-Brooks
> <eurleif@ecritters.biz> wrote:
>
>
> To archive, although I'm never to fond of 'opt-out'. To archive. Not
> to reuse in some goofey interface such as Google Groups, as if it was
> the next great Google invention.
It isn't. It was deja news. And what's the point of an archive if you can't
access it? Moreover, are there restrictions on how one makes an interface
to an archive?
--
John PERL SEO tools: http://johnbokma.com/perl/
Experienced (web) developer: http://castleamber.com/
Get a SEO report of your site for just 100 USD:
http://johnbokma.com/websitedesign/seo-expert-help.html
| |
| Toby Inkster 2005-09-10, 7:29 pm |
| Barbara de Zoete wrote:
> Does Google have the authors permission?
> [to archive posts]
Google creates a copy of your post on their server; so do thousands of
other servers. The other servers only keep their copies for a week or so;
Google keeps their copy for a lot longer.
Length of retention is the only difference between Google any other NNTP
servers; length of retention does not factor into copyright law AFAIK.
So if Google's copies are illegal, so are the copies on every other
server. (And that is quite possible!)
--
Toby A Inkster BSc (Hons) ARCS
Contact Me ~ http://tobyinkster.co.uk/contact
Now Playing ~ ./tegan_and_sara/if_it_was_you/02_you_went_away.ogg
| |
| Big Bill 2005-09-10, 7:29 pm |
| On Sat, 10 Sep 2005 19:15:57 +0200, "Barbara de Zoete"
<b_de_zoete@hotmail.com> wrote:
>On Sat, 10 Sep 2005 15:04:11 GMT, Leif K-Brooks <eurleif@ecritters.biz> wrote:
>
>
>To archive, although I'm never to fond of 'opt-out'. To archive. Not to reuse in
>some goofey interface such as Google Groups, as if it was the next great Google
>invention.
In an inaccessible archive, then? Who'd know?
BB
--
www.kruse.co.uk/ seo@kruse.demon.co.uk
Elvis does my SEO
--
| |
| Barbara de Zoete 2005-09-10, 7:29 pm |
| On Sat, 10 Sep 2005 19:26:30 +0100, Toby Inkster
<usenet200509@tobyinkster.co.uk> wrote:
> Barbara de Zoete wrote:
>
>
> Google creates a copy of your post on their server;
If it stops there, fine. But it then uses the copy in some fancy but buggy
interface it calls Google Groups and encourages people to become 'member' to
participate in them. That part is quite different. I am just not sure I like
anyone using my posts for something interactive like that. Archive, searchable,
the like. Great. Reuse them for another purpose and you're slipping and sliding
somewhere not necessarily good.
--
,-- --<--@ -- PretLetters: 'woest wyf', met vele interesses: ----------.
| weblog | http://home.wanadoo.nl/b.de.zoete/_private/weblog.html |
| webontwerp | http://home.wanadoo.nl/b.de.zoete/html/webontwerp.html |
|zweefvliegen | http://home.wanadoo.nl/b.de.zoete/html/vliegen.html |
`-------------------------------------------------- --<--@ ------------'
| |
| Barbara de Zoete 2005-09-10, 7:29 pm |
| On Sat, 10 Sep 2005 19:21:26 GMT, Big Bill <kruse@cityscape.co.uk> wrote:
> On Sat, 10 Sep 2005 19:15:57 +0200, "Barbara de Zoete"
> <b_de_zoete@hotmail.com> wrote:
>
>
> In an inaccessible archive, then? Who'd know?
>
No, just an archive will be fine. One can search through an archive all one
wants, just fine. Not communicate through it though. That is a big difference.
> --
Your sig-separator is broken.
--
,-- --<--@ -- PretLetters: 'woest wyf', met vele interesses: ----------.
| weblog | http://home.wanadoo.nl/b.de.zoete/_private/weblog.html |
| webontwerp | http://home.wanadoo.nl/b.de.zoete/html/webontwerp.html |
|zweefvliegen | http://home.wanadoo.nl/b.de.zoete/html/vliegen.html |
`-------------------------------------------------- --<--@ ------------'
| |
| David Dorward 2005-09-10, 7:29 pm |
| Barbara de Zoete wrote:
> If it stops there, fine. But it then uses the copy in some fancy but buggy
> interface it calls Google Groups and encourages people to become 'member'
> to participate in them.
Demon (and many others) encourage people to send them money to become
"customers" to participate in them.
--
David Dorward <http://blog.dorward.me.uk/> <http://dorward.me.uk/>
Home is where the ~/.bashrc is
| |
|
| In news:opswwjjmi0x5vgts@zoete_b,
Barbara de Zoete <b_de_zoete@hotmail.com> wrote:
> On Sat, 10 Sep 2005 19:26:30 +0100, Toby Inkster
> <usenet200509@tobyinkster.co.uk> wrote:
>
>
> If it stops there, fine. But it then uses the copy in some fancy but
> buggy interface it calls Google Groups and encourages people to
> become 'member' to participate in them. That part is quite different.
> I am just not sure I like anyone using my posts for something
> interactive like that.
I don't understand the last part of the paragraph above. If you don't want
interaction, why post publically at all?
If it's just newsgroup interaction with incoming posts from Google Groups
that annoy you, there are many easily implemented ways around that.
> Archive, searchable, the like. Great. Reuse
> them for another purpose and you're slipping and sliding somewhere
> not necessarily good.
I still don't think I understand. Are you saying you don't mind if your
posts get archived, so long as nobody can respond to them? Or just that
nobody who sees them via Google Groups need reply?
| |
| Barbara de Zoete 2005-09-10, 7:29 pm |
| On Sat, 10 Sep 2005 16:02:45 -0400, TJ <none@noemail.invalid> wrote:
[Google Groups]
> I still don't think I understand. Are you saying you don't mind if your
> posts get archived, so long as nobody can respond to them? Or just that
> nobody who sees them via Google Groups need reply?
>
I don't like the 'get to be a member and participate' bit of Google Groups. You
don't need to be a member to participate in usenet. I know I'me not. Where did
Google get the idea that they can use content created by _me_ and you and others
to create a member database they call their own? That's no good.
--
,-- --<--@ -- PretLetters: 'woest wyf', met vele interesses: ----------.
| weblog | http://home.wanadoo.nl/b.de.zoete/_private/weblog.html |
| webontwerp | http://home.wanadoo.nl/b.de.zoete/html/webontwerp.html |
|zweefvliegen | http://home.wanadoo.nl/b.de.zoete/html/vliegen.html |
`-------------------------------------------------- --<--@ ------------'
| |
| Charles Sweeney 2005-09-10, 7:29 pm |
| Paul N Burke wrote
> On Sat, 10 Sep 2005 13:55:44 GMT, "Nospam" <nospam@home.com> wrote:
>
> Have you the authors permission to do that ?
I wouldn't have thought he would have needed the author's permission.
Does Google have your permission to archive your posts?
--
Charles Sweeney
http://CharlesSweeney.com
| |
|
| In news:opswwkuxxyx5vgts@zoete_b,
Barbara de Zoete <b_de_zoete@hotmail.com> typed:
> On Sat, 10 Sep 2005 16:02:45 -0400, TJ <none@noemail.invalid> wrote:
>
> [Google Groups]
>
>
> I don't like the 'get to be a member and participate' bit of Google
> Groups.
Why? You personally are either a "member" of an ISP that includes Usenet
newsgroups as part of your service, or you subscribe to an outside source.
> You don't need to be a member to participate in usenet.
Sure ya do. Be it ISP, news provider, or Google Groups, one is required to
be a 'member' to participate.
> I
> know I'me not. Where did Google get the idea that they can use
> content created by _me_ and you and others to create a member
> database they call their own? That's no good.
Ever heard of the X-No Archive header? I think Google still honors that,
though it's of no use if someone includes your entire post in a reply.
Please fix your signature.
| |
| Barbara de Zoete 2005-09-10, 7:29 pm |
| On Sat, 10 Sep 2005 16:31:56 -0400, TJ <none@noemail.invalid> wrote:
> In news:opswwkuxxyx5vgts@zoete_b,
> Barbara de Zoete <b_de_zoete@hotmail.com> typed:
>
>
> Why? You personally are either a "member" of an ISP that includes Usenet
> newsgroups as part of your service, or you subscribe to an outside source.
>
>
> Sure ya do. Be it ISP, news provider, or Google Groups, one is required
> to be a 'member' to participate.
>
I'm not sure. I just sense a difference in having an ISP provide newsgroups, a
newsserver provide newsgroups or Google provide not only newsgroups but also an
interface to participate in them (Google being the example here, since there are
others) and some subscription mechanism with it.
--
,-- --<--@ -- PretLetters: 'woest wyf', met vele interesses: ----------.
| weblog | http://home.wanadoo.nl/b.de.zoete/_private/weblog.html |
| webontwerp | http://home.wanadoo.nl/b.de.zoete/html/webontwerp.html |
|zweefvliegen | http://home.wanadoo.nl/b.de.zoete/html/vliegen.html |
`-------------------------------------------------- --<--@ ------------'
| |
|
| In news:opswwmgvamx5vgts@zoete_b,
Barbara de Zoete <b_de_zoete@hotmail.com> wrote:
> On Sat, 10 Sep 2005 16:31:56 -0400, TJ <none@noemail.invalid> wrote:
>
>
> I'm not sure. I just sense a difference in having an ISP provide
> newsgroups, a newsserver provide newsgroups or Google provide not
> only newsgroups but also an interface to participate in them (Google
> being the example here, since there are others) and some subscription
> mechanism with it.
There's a BIG difference. And I don't blame you for having a problem with
posts from Google Groups. Who do you think took the overflow when AOL
discontinued Usenet service a while ago? ;)
| |
| David Dorward 2005-09-10, 7:29 pm |
| Barbara de Zoete wrote:
> I'm not sure. I just sense a difference in having an ISP provide
> newsgroups, a newsserver provide newsgroups or Google provide not only
> newsgroups but also an interface to participate in them
Demon (and most others) provide an interface to participate in them - an
nntp server.
> some subscription mechanism with it.
Like just about any other provider of Usenet access.
--
David Dorward <http://blog.dorward.me.uk/> <http://dorward.me.uk/>
Home is where the ~/.bashrc is
| |
| Greg N. 2005-09-10, 7:29 pm |
| Barbara de Zoete wrote:
> I don't like the 'get to be a member and participate' bit of Google
> Groups. You don't need to be a member to participate in usenet.
Many old-fashioned usenet servers require you to be some kind of member,
either through an ISP contract or through other means of registration.
For many usenet setvers, you even need to pay extra.
> Where did Google get the idea that they can use content
> created by _me_ and you and others to create a member
> database they call their own? That's no good.
Where did the old fashioned news servers get the idea that they can use
content created by _me_ and you and others and then charge you and me
and others to get access? That is worse, no?
--
Gregor's Motorradreisen:
http://hothaus.de/greg-tour/
| |
| Anonymous 2005-09-10, 7:29 pm |
| In article <ABBUe.128$Xy2.118@newsfe7-win.ntli.net>,
"Nospam" <nospam@home.com> had this to say:
>I am looking for a program out there that can extract posts of a particular
>author on google groups and ave them in one txt file hopefully without any
>replies, is such a prog availble?
>
>any help will be greatly appreciated
I think you'd be lucky enough to accomplish that manually using the Google
groups advanced search here: http://groups.google.com/advanced_search .
If your "particular author" always signed posts using the same "sig" line, then
you can just type or paste it into the "exact phrase" field. Doing that should
return most if not all of that author's messages and quoted replies.
But remember, if the author ever used the x no archive directive, then those
posts won't show up in the archives .
If there's an actual script or dedicated programme for automatically searching
out and extracting user designated posts, I'm guessing that the folks at Google
are keeping that a very tightly guarded secret, to prevent potential abuse .
Ckip
---
-=-
This message was sent via two or more anonymous remailing services.
| |
| Charles Sweeney 2005-09-10, 11:17 pm |
| Anonymous wrote
> In article <ABBUe.128$Xy2.118@newsfe7-win.ntli.net>,
> "Nospam" <nospam@home.com> had this to say:
>
> I think you'd be lucky enough to accomplish that manually using
> the Google
> groups advanced search here: http://groups.google.com/advanced_search
> .
>
> If your "particular author" always signed posts using the same
> "sig" line, then
> you can just type or paste it into the "exact phrase" field. Doing
> that should return most if not all of that author's messages and
> quoted replies.
Or just search on the author's name. (Potentially tricky if it's a
common name!)
--
Charles Sweeney
http://CharlesSweeney.com
| |
| +@+.+ 2005-09-10, 11:17 pm |
|
Barbara de Zoete says...
>Besides this being off-topic
Off-topic where? You posted to alt.www.webmaster,
alt.websites, alt.internet.search-engines, and alt.html
+
| |
| JennyLin 2005-09-10, 11:17 pm |
|
"Charles Sweeney" <me@charlessweeney.com> wrote in message
news:Xns96CDF2ECD2151mecharlessweeneycom@130.133.1.4...
> Anonymous wrote
>
and maybe I should have read through this thread before posting my 'aww what
have you done'.
At least I understand now, I had no idea my posts would end up on google
searches....... live and learn huh
Jenny
| |
| Guy Macon 2005-09-10, 11:17 pm |
|
Toby Inkster wrote:
>Google creates a copy of your post on their server; so do thousands of
>other servers. The other servers only keep their copies for a week or so;
>Google keeps their copy for a lot longer.
Supernews keeps posts to text newsgroups for well over a year.
| |
| Nospam 2005-09-11, 4:15 am |
|
"Charles Sweeney" <me@charlessweeney.com> wrote in message
news:Xns96CDF2ECD2151mecharlessweeneycom@130.133.1.4...
> Anonymous wrote
>
>
> Or just search on the author's name. (Potentially tricky if it's a
> common name!)
>
> --
> Charles Sweeney
> http://CharlesSweeney.com
The author has a particular signature and I can isolate all his posts via a
search of the author, I am then left with about 1,500 posts all uniquely
from the author the question is how do I extract them all into one text
file, without manually going through them with copy and paste?
| |
| Paul N Burke 2005-09-11, 4:15 am |
| On Sun, 11 Sep 2005 02:37:22 GMT, "Nospam" <nospam@home.com> wrote:
>
>"Charles Sweeney" <me@charlessweeney.com> wrote in message
>news:Xns96CDF2ECD2151mecharlessweeneycom@130.133.1.4...
>
>The author has a particular signature and I can isolate all his posts via a
>search of the author, I am then left with about 1,500 posts all uniquely
>from the author the question is how do I extract them all into one text
>file, without manually going through them with copy and paste?
>
Back to my original question ,..............
Have you the authors permission to do that ?
Does he know you are doing this ?
Have you spoken to him/her about it ?
----== Posted via codecomments.com - Unlimited-Uncensored-Secure Usenet News==----
http://www.codecomments.com The #1 Newsgroup Service in the World! 120,000+ Newsgroups
----= East and West-Coast Server Farms - Total Privacy via Encryption =----
| |
| Anonymous 2005-09-11, 4:15 am |
| On 10 Sep 2005, Charles Sweeney <me@charlessweeney.com> wrote:
>Anonymous wrote
>
>
>Or just search on the author's name. (Potentially tricky if it's a
>common name!)
Yes, that's a good point. If that were the case, and the author had
always signed their posts with "Jane," or "Joe," or something really common
like that, then the author would have to have shown some other predictable
and distinguishing characteristic to his/her writing style that would
make their articles identifiable in a broad search of the groups .
I would guess that if the author had a favorite group that they usually
posted to, then one could quickly narrow the search by confining it to that
group or perhaps group heirarchy, using wild cards like "alt.internet.*"
That would surely narrow down the list of Janes or Joes in the archive .
The problem is that many authors have changed their posting email
address or alias multiple times over the years . If the searcher has a
complete list of these aliases, then they could do the search effectively
just by checking for each alias or email address in the "author" field.
If that didn't fit the bill, then maybe the author had consistently
misspelled a commonly used word, like "computor," or "sophtware?" Or maybe
they had written commonly used phrases in a uniquely uncommon way . But
unless they did something really odd like that, I doubt that anyone would
have much luck searching out their archived articles if no particular
group had been that author's usual haunt, which would be rare in any case .
It seems that authors who want their posts to be more readily searchable
have used uncommon names, or maybe unusual combinations of otherwise common
names, like "Joe Jane," or something. Not too many Joe Jane's out there,
I suspect . And most authors tend to post to the same groups over time.
Ckip
---
-=-
This message was sent via two or more anonymous remailing services.
| |
|
| On Sun, 11 Sep 2005 04:37:22 +0200, Nospam <nospam@home.com> wrote:
> The author has a particular signature and I can isolate all his posts
> via a search of the author, I am then left with about 1,500 posts all
> uniquely
> from the author
Plus all misquoted replies probably.
Best,
Borek
--
http://www.chembuddy.com - chemical calculators for labs and education
BATE - program for pH calculations
CASC - Concentration and Solution Calculator
pH lectures - guide to hand pH calculation with examples
| |
| Charles Sweeney 2005-09-11, 11:23 pm |
| Paul N Burke wrote
> On Sun, 11 Sep 2005 02:37:22 GMT, "Nospam" <nospam@home.com> wrote:
>
> Back to my original question ,..............
> Have you the authors permission to do that ?
What's it got to do with you?
--
Charles Sweeney
http://CharlesSweeney.com
| |
| Luigi Donatello Asero 2005-09-11, 11:23 pm |
|
"Charles Sweeney" <me@charlessweeney.com> skrev i meddelandet
news:Xns96CFA678400Cmecharlessweeneycom@130.133.1.4...
> Paul N Burke wrote
>
>
> What's it got to do with you?
Don´t you want to ask him or her?
--
Luigi Donatello Asero
(sono italiano ma vivo in Svezia)
(я итальянец но я живу в Швеции )
( 我是 意大利人 , 但是 我 主 在 瑞典)
(minä olen Italian kansalainen, mutta minä asun Ruotsissa)
https://www.scaiecat-spa-gigi.com/i...rg/amundoen.php
| |
| Paul N Burke 2005-09-11, 11:23 pm |
| On 12 Sep 2005 00:01:15 GMT, Charles Sweeney <me@charlessweeney.com>
wrote:
>Paul N Burke wrote
>
<snip>
[color=darkred]
>
>What's it got to do with you?
Because it was a valid question. Unlike yours
--
http://www.FirstPeople.us/
Free "Native American" Clipart, Wallpaper Pictures, Legends, Wisdom, Artwork for sale. Safe Surf Rated
----== Posted via codecomments.com - Unlimited-Uncensored-Secure Usenet News==----
http://www.codecomments.com The #1 Newsgroup Service in the World! 120,000+ Newsgroups
----= East and West-Coast Server Farms - Total Privacy via Encryption =----
| |
| Nospam 2005-09-12, 7:28 am |
|
"Anonymous" <BigappleRemailer@bigapple.yi.org> wrote in message
news:Z4UAXFE338606.1136342593@anonymous...
> On 10 Sep 2005, Charles Sweeney <me@charlessweeney.com> wrote:
>
> Yes, that's a good point. If that were the case, and the author had
> always signed their posts with "Jane," or "Joe," or something really
common
> like that, then the author would have to have shown some other predictable
> and distinguishing characteristic to his/her writing style that would
> make their articles identifiable in a broad search of the groups .
>
> I would guess that if the author had a favorite group that they
usually
> posted to, then one could quickly narrow the search by confining it to
that
> group or perhaps group heirarchy, using wild cards like "alt.internet.*"
> That would surely narrow down the list of Janes or Joes in the archive .
>
> The problem is that many authors have changed their posting email
> address or alias multiple times over the years . If the searcher has a
> complete list of these aliases, then they could do the search effectively
> just by checking for each alias or email address in the "author" field.
>
> If that didn't fit the bill, then maybe the author had consistently
> misspelled a commonly used word, like "computor," or "sophtware?" Or
maybe
> they had written commonly used phrases in a uniquely uncommon way . But
> unless they did something really odd like that, I doubt that anyone would
> have much luck searching out their archived articles if no particular
> group had been that author's usual haunt, which would be rare in any case
..
>
> It seems that authors who want their posts to be more readily
searchable
> have used uncommon names, or maybe unusual combinations of otherwise
common
> names, like "Joe Jane," or something. Not too many Joe Jane's out there,
> I suspect . And most authors tend to post to the same groups over time.
>
> Ckip
>
> ---
>
>
>
>
> -=-
> This message was sent via two or more anonymous remailing services.
>
There is no problem finding out the authors' posts please spare me the
netiquette of reading freely available posts already on google groups, does
anybody know of a program to combine such posts into a single text file or
similar?
| |
|
|
|
| | Copyright 2003 - 2008 forum4designers.com Software forum Computer Hardware reviews |
|